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Thank you for sharing 


Initiatives to make genetic and medical data publicly available could improve diagnostics — but 
they lose value if they do not share with other projects. 


flow of information through the Internet. Pictures, articles, jokes, 
links, ideas, criticism — information has never been so free to 
move. And from open access to giant web-based data repositories, 
science in 2015 is increasingly based on shared knowledge and expertise. 

Sharing should be equal, but some is more equal than others. The 
principals behind one genetic data-sharing project unveiled last week 
have described their initiative as a model of “scientific openness” that 
offers “broader access” to genetic data. Indeed, the name of the project 
— BRCA Share — trades on the idea of data freedom. The initiative 
focuses on clinical data concerning mutations in the genes BRCA1 and 
BRCA2, which increase risk of breast and ovarian cancer. 

In truth, it creates more of a walled garden of genetic data than 
an open field. That runs contrary to crucial ongoing efforts to amass 
large amounts of linked genetic and medical data to help scientists and 
doctors to improve interpretation of genetic test results. 

The commercial market for genetic testing for breast cancer is 
growing. On 21 April, for instance, a firm called Color Genomics in 
Burlingame, California, promised to offer women BRCA gene testing 
— which normally costs thousands of dollars — for as little as US$249. 
If the companies that offer such testing share their data, stripped of 
identifying information, with researchers, it could aid efforts to under- 
stand how all of the thousands of possible BRCA mutations affect 
cancer risk. The more data researchers can gather, the more they can 
determine whether ‘variants of unknown significance’ — genetic dif- 
ferences whose health effects are unknown — are benign or pose risks. 

BRCA Share is a partnership between Quest Diagnostics of Madison, 
New Jersey, and INSERM, the French National Institute of Health and 
Medical Research. Testing labs that sign up will get access to each other’s 
BRCA data and an INSERM repository of information about the genes. 
The project will run functional studies on the effects of mutations. 

But it will not share data with similar efforts such as ClinVar, a 
US National Institutes of Health-funded initiative that is making linked 
genetic and medical data publicly available for all. Quest says that BRCA 
Share cannot contribute to ClinVar because its data are structured dif- 
ferently. But this seems a weak excuse. Researchers say that increasingly, 
where there is a will to share, a technological way can be found. Because 
of the restrictions, several labs have declined to join BRCA Share. 

Quest says that participants in BRCA Share will be allowed to share 
their own data with Clin Var. But the first to join the initiative — diag- 
nostics company LabCorp of Burlington, North Carolina — has so far 
chosen not to; it has not deposited any BRCA data in ClinVar. Quest 
says that it will share BRCA and other genetic data with the Human 
Variome Project and the Leiden Open Variation Database, based in 
the Netherlands, which has an agreement to share data with Clin Var, 
but the agreement between Quest and Leiden has not yet been signed. 

The episode showcases an uncomfortable truth about personalized 
medicine: everyone agrees that large data sets are crucial, and everyone 


Te modern world is all about sharing, driven by the borderless 


is racing to collect them. The larger the data set, the more useful. The 
most useful of all would be one huge database containing all available 
data. But even though all parties recognize the value of it, many are 
choosing not to share, and this holds back medical progress. 

BRCA Share is only one example; there are many others. The 
company 23andMe in Mountain View, California, for instance, has 
collected genetic data on 900,000 people. It shares aggregate statistics 

with outside researchers and has published or 


“Even though contributed to 32 papers in the past 5 years. 
all parties But to protect users’ privacy, it does not rou- 
recognize the tinely share linked genetic and medical data. 
value of it, many The field is young and will take time to 
arechoosing not _ settle. Already, dissatisfaction with limits 
to share.” on sharing is driving new initiatives, such as 


Genes for Good at the University of Michigan 
in Ann Arbor, which is enlisting Facebook users for a genetic study (see 
page 597), and the online community Open Humans, which is helping 
participants in genetic studies to share their data with other researchers. 

Awareness of the issue among consumers is also driving companies 
to share more widely. Color Genomics, for instance, promises to share 
its BRCA data with ClinVar, as do many other companies. 

The imperative to share data remains an esoteric issue for much of 
the public, and one that is easily obscured. Companies or research- 
ers who talk the talk of sharing but do not actually walk the walk 
should be challenged. Data sharing is too important to be turned into 
meaningless marketing speak. = 


A hard sell 


Scientists must stand up for marine parks if the 
value of the seas is to be recognized globally. 


England's southern coastline, is both a beauty spot and the site of 
“the most popular naturist beach in Britain”. More importantly 
for Nature-ists, the seagrass that thrives in the bay’s shallow waters is 
home to rare sea-horse populations. But keeping it that way is a grow- 
ing challenge, because Studland is also a playground for the wealthy 
and powerful, including some yacht-club members who like to anchor 
their boats there, to the possible detriment of the natural habitat. 
Scientists can make the case for conservation, but the value of 
marine sites such as Studland is hard to sell. It is not the Great Barrier 
Reef with its miles of coral, or the Galapagos Islands and their sharks, 
turtles and marine iguanas. It is not even Lundy — the island at the 


S tudland Bay is an unlikely battleground. The sandy shore, part of 
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centre of the first UK marine park, home to seals and puffins. 

Last week, the UK government ended a public consultation on the 
latest round of its attempts to manage conflicts between conservation 
and other maritime interests. Officials plan to create dozens of marine 
conservation zones (MC7Zs) around the UK coast, to introduce an extra 
level of protection for worthy sites. The move is part of a global effort 
to extend conservation measures on land to the coast and open seas. 

But, as on land, conservation of marine areas is not popular with 
everyone, and the ambitious UK plans are in danger of running 
aground. Already, proposals for the Studland MCZ have been dropped 
for now, and plans for others are in danger. In fact, marine experts who 
originally identified 127 sites for MCZs around the British coastline 
now fear that, in the worst case, barely more than half will be realized. 

The UK process sets up these zones in three waves. The first 
saw 31 proposed MCZs reduced to 27, which were designated in 
November 2013. A second tranche of designations initially featured 
37 sites, but was whittled down to 23 even before the public consulta- 
tion closed last week. (Studland Bay MCZ was one of the 14 that were 
chopped.) The third wave is meant to be set up before 2016. 

The United Kingdom is squaring up for a general election on 7 May, 
and even long-serving politicians are nervous about keeping their 
seats. Marine conservation is low on the agenda — especially the sort 
that annoys boat-owning voters. But the next government will have to 
take some tough decisions on marine protection. Because of the way 
the first two tranches of MCZ designation have progressed, the third 
will be left with difficult decisions on controversial sites that have been 
kicked into the long (sea)grass by the earlier rounds. These include 


not only Studland but also the important deep-water mud habitats of 

the Celtic Deep, and another sea-horse habitat off the Isle of Wight. 
Crucially for the United Kingdom, failures at home are undermining 
its potential to show global leadership on marine conservation. Huge 
marine parks have been designated around UK territories such as the 
Chagos Islands in the Indian Ocean, and another was announced on 
18 March for the Pitcairn Islands in the south Pacific. Significant ques- 
tions remain over enforcement and monitor- 


“The UK ing of these overseas reserves, and must be 
government solved if they are not to become what scien- 
deserves credit tists call ‘paper parks’ — just a line on a map 
forestablishing rather than something that does any good. 


But the UK government deserves credit for 
establishing these parks at all. This is despite a 
recent setback for the Chagos reserve — a judge ruled last month that 
the park had been improperly declared by Britain because Mauritius, 
which has fishing rights in the archipelago, had not been fully consulted. 

Later this year, the finalized United Nations Sustainable Devel- 
opment Goals are likely to include a commitment to protecting the 
oceans. Making good on this and on existing promises will require 
governments to disappoint some powerful vested interests that would 
prefer to keep access to marine areas unrestricted. And if rich nations 
wish to push the rest of the world to protect its marine wonders, they 
must first get their own houses in order. Scientists can help by pro- 
moting the value of places that lack the brand recognition of the Great 
Barrier Reef. If the world’s seas are really going to be protected, mud 
and seagrass will have to be considered alongside tropical reefs. = 


the parks at all.” 


More on unicorns 


A newly discovered tiny dinosaur sported an 
intriguing structural accessory. 


let’s say that you have found your unicorn, but then what? 

How would you know that it was a unicorn, rather than some- 
thing else? How diagnostic, say, is that horn? Might there not be other 
creatures, as yet unimagined, that have unicorn-like horns but differ 
from unicorns (and everything else) in all kinds of unusual ways? 
Narwhals, for example, exist — and their unicorn-like horn is the 
only feature they have in common with unicorns. So what else might 
lurk out there in the great unknown? You know that unicorns don’t 
number among the domestic beasts, says Jorge Luis Borges in his essay 
‘Kafka and his precursors’, but, without seeing one, how do you know 
what a unicorn is “like”? 

Palaeontology is always about hunting for unicorns. Because 
fossils sample diversity from the past that might not exist today, there is 
always a chance that one will dig up something that defies categoriza- 
tion. Such is the case for the tiny dinosaur described online in Nature 
by Xing Xu and colleagues (see go.nature.com/jsxjxv), and discussed 
further by Kevin Padian (go.nature.com/s6g2aw). 

The dinosaur is about as far from Brontosaurus and other 
behemoths familiar to the museum visitor as might be imagined. 
It belongs to a little-known group of creatures whose body size has 
hitherto scaled inversely with the length of their names. It is only the 
third known example ofa (deep breath) scansoriopterygid, the other 
two bearing the unwieldy handles Epidendrosaurus and Epidexipteryx. 
Xu and colleagues buck the trend by calling their creature Yi. Or, in 
full, Yi qi. This must be the shortest dinosaur name ever, and it is com- 
mensurate with the tiny size of scansoriopterygids, which could have 
looked thrushes or starlings in the eye. Although scansoriopterygids 
had feathers and cluster phylogenetically round the ancestry of birds 


H ow do you go about hunting for unicorns? More specifically, 
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and other feathered dinosaurs, they do not seem to have had flight 
feathers on their disproportionately long forelimbs. Reconstructions 
make them out to be rather like feathered lemurs, scampering along 
branches, perhaps wheedling insects out of crevices with their long, 
clawed fingers. 

Except that Yi qi is different. Attached to each wrist is a strut, made 
of bone or calcified cartilage, which cannot simply be homologized 
with regular wrist or hand bones. The strut seems to be a new structure 
made from an accessory wrist bone, possibly a sesamoid — the kind 
of bone usually embedded in a tendon or muscle. The ‘thumb of the 
giant panda is made of just such a bone. The sesamoid of Yi qi (if that’s 
what it is) is much larger, however, in relation to the animal as a whole 
— equal in length to the bones of the forearm. It had to be there for a 
reason, but what was it? 

Itis here that we enter unicorn territory — for no dinosaur, however 
unusual, has been found with anything like this feature. The authors are 
appropriately cautious, therefore, in their interpretation. They point to 
the hint of a suggestion that some soft tissue, preserved alongside these 
curious elements, represents what might have been a membrane that 
the sesamoid bone supported. From that, they suggest that Yi qi had 
membranous wings and might have glided from branch to branch, in 
much the same way as various tree-living mammals and reptiles do 
today. But it was probably not capable of powered flight as birds and 
bats are — and as were, presumably, the extinct pterosaurs, which were 
(one must stress) only distant relatives of dinosaurs and birds. 

When, in the mid-1990s, the first dinosaurs with preserved 
feathers came to light, there was a great deal of celebration. However, 
evidence for the bird-like nature of dinosaurs had been accumulat- 
ing for a while, so for many (though not all) people, feathered dino- 
saurs were a vindication rather than a challenge. Yi qi, by contrast, is 
something else. Here we have a feathered dinosaur and a close relative 
of birds that seems to have essayed an entirely different experiment 
in aerial locomotion. For a feathered dinosaur 
to have traded feathers for a membrane in an 
aerofoil is something nobody could have pre- 
dicted. Whether or not it is a unicorn has yet to 
be determined. m 
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databases, data sharing and increased access to technical 

resources. Until recently, this trend towards open science was 
helping scientists in China to compete on the world stage. However, 
growing constraints on public-data exploration and sharing are being 
felt throughout Chinese society. If not addressed, they will slow scientific 
research and innovation. 

It is hard — and getting harder — for Chinese scientists to access 
high-quality domestic data. Most of the public data are held by govern- 
ment departments, some of which are strengthening their monopoly 
and making it harder for researchers to access the information. This 
affects researchers in the humanities and social sciences especially, but 
also extends to fields such as environmental science and public health, 
because the data involved can be politically sensi- 
tive. At conferences, I hear numerous complaints 
from colleagues about how hard it is to extract 
routine figures such as air-pollutant levels from 
the authorities, for example. 

Even when data are published, some are likely to 
be of poor quality because they have not been col- 
lected properly. The most notable example is the 
controversy on China's gross domestic product 
(GDP). There is a significant — and widening — 
difference between the official national estimate 
and the total calculated by adding up the GDPs of 
each of China's 31 province-level divisions. The 
National Bureau of Statistics in Beijing admits that 
different data-collection methods are used at the 
provincial level, and is trying to harmonize them. 
So far, progress has not been encouraging. 

Public data sharing has been turned into a 
profit-making scheme. It would be useful, for example, to compile data 
on pollution from road vehicles in China. Done properly, this would 
require access to detailed records on the number of each type of vehicle 
licensed, road congestion, detailed engine parameters and fuel stand- 
ards. Research institutions struggle to get even basic figures on vehicle 
ownership from public agencies, so they must use less rigorous — and 
often misleading — sales data that industry groups collect from manu- 
facturers. Ironically, the same wealthy automobile manufacturers that 
inflate their own numbers can get objective, reliable data about their 
competitors’ sales by buying them from special channels that are linked 
with some government departments — at a price that public institutions 
and scientists cannot afford. 

In such an environment, it is no surprise that some research teams 
in China do not want to publish their own data. 


Te era of big data has seen the development of bioinformatics 
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China’s scientific progress 
hinges on access to data 


Itis getting harder for scientists in China to obtain the high-quality public 
data that they need for important research studies, says Zheng Wan. 


as port statistics and fleet information, that we have compiled, but we are 
reluctant to share the information. The workload and cost of sourcing 
and sorting scattered data sets into a usable form are enormous. If we 
keep these data exclusive, we can use them to develop research papers. If 
it were easier to access good data from other sources, we would be more 
comfortable with giving our own away. 

Open access to and improved quality of public data can promote 
transparency in government affairs. Despite the slow progress, there 
are positive examples of improved transparency and howit has benefited 
Chinese society. A notable one is the full disclosure of air-pollution data, 
which started in 2014 as a result of mounting public pressure on environ- 
ment bureaus. Before this, only sketchy data were published on a daily 
basis. These data are now updated hourly and are widely shared between 
government agencies in China. The data allow the 
health bureau to send alerts to the general public, 
the education bureau to decide whether classes 
should be suspended during smoggy days and the 
transportation bureau to adjust its vehicle-restric- 
tion rules. Perhaps the most important effect of 
the full disclosure is heightened public awareness 
of the worsening pollution crisis. Unfortunately, 
not all data that interest scientists also interest the 
general public. There is still a long way to go in 
achieving full transparency and increasing the 
availability of public data. 

The restrictions are not confined to informa- 
tion generated and held inside China. Foreign 
academic resources can also be technically 
demanding to access. Several information- 
management bureaus have set up digital road- 
blocks to filter supposedly harmful information. 

My life as a working scientist in China is affected. Reliable searches of 
the academic literature are near impossible. With no access to Google 
Scholar — which I prefer over other search engines, because it com- 
bines books, papers, theses, patents and technical reports — I have to 
keep track of trends by individually searching the databases operated 
by publishers that are still accessible. 

Secure networks are crucial for national security, but good data are the 
backbone of scientific progress and economic development. Resource 
sharing and public access to trustworthy information underpin eco- 
nomic and social well-being. In China, resolving these conflicts will 
entail a comprehensive study, so that we can establish a highly efficient 
and reasonable data-management mechanism that benefits all. More 
immediately, researchers should be granted greater access, especially 
to public data and to academic search engines. m 


Zheng Wan is a research scientist at the College of Transport and 
Communications, Shanghai Maritime University, China. 
e-mail: mrwan@ucdavis.edu 
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Light direct from 
an alien world 


Astronomers have spotted light 
reflected off a planet orbiting 
a distant sun, by teasing it 
out from the background 
starlight. The discovery allows 
direct calculations of the mass 
and other properties of the 
exoplanet, rather than inferring 
them using other methods. 
Jorge Martins of the 
University of Porto in Portugal 
and his team used the HARPS 
instrument at the European 
Southern Observatory’s 
La Silla facility in Chile to 
study 51 Pegasi b, which was 
the first ever exoplanet found 
around a Sun-like star. The 
team subtracted the starlight, 
leaving only the faint planetary 
spectrum. From this, they 
calculated the planet’s mass to 
be about half that of Jupiter. 
Detecting this reflected 
light from other exoplanets 
could allow astronomers to 
determine more characteristics 
than other techniques can. 
Astron. Astrophys. 576, A134 
(2015) 


Bacteria swim to 
form crystals 


Fast-swimming bacteria 

that live at the bottom of salt 
marshes spontaneously come 
together to form organized 
crystalline structures that move 
through water. 

The bacterium Thiovulum 
majus is a large, round cell with 
hundreds of small flagella, or 
tail-like structures, that spin to 
propel the organism. Alexander 
Petroff of the Rockefeller 
University in New York and 
his colleagues observed the 
behaviour of individually 
spinning cells under a 
microscope and found that 
each created tornado-like flows 


RCH HIGHLIGHTS 


Selections from the 
scientific literature 


BIOMECHANICS 


How grebes walk on water 


The heaviest animals known to run on water 


team analysed high-speed video of wild birds 


pull off the feat by using quick strides and large 
feet that slap the surface. 

Western and Clark's grebes (Aechmophorus 
occidentalis and Aechmophorus clarkii; pictured) 
run as far as 20 metres on water and for up to 
7 seconds during a mating ritual, and are among 
only a few animals with this ability. Glenna 
Clifton at Harvard University’s Concord Field 
Station in Bedford, Massachusetts, and her 


performing the dance and studied models of the 
grebe foot in the laboratory. 

They found that the birds stay above the water 
by having a fast stride rate of up to 20 steps a 
second, as well as wide, flat feet that slap the 
water surface with enough force to support up 
to 55% of the bird’s weight. The animal's feet are 
also shaped to reduce drag. 

J. Exp. Biol. 218, 1235-1243 (2015) 


in the liquid around them. 
These forces cause the cells to 
self-assemble into groups of 
between 10 and 1,000 cells, 
arranged in a two-dimensional 
crystalline hexagonal shape 
that rotates collectively. 

Phys. Rev. Lett. 114, 158102 (2015) 


HYDROLOGY 


Groundwater 
under Antarctica 


A groundwater network found 
beneath an Antarctic valley 
could support microbial life. 

jill Mikucki of the University 
of Tennessee in Knoxville, 
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Slawek Tulaczyk at the 
University of California, Santa 
Cruz, and their colleagues 
studied Taylor Valley using 
an airborne instrument 

to measure underground 
electrical resistivity, which 
increases as water in soil 
freezes. The team discovered 
two systems of briny 
groundwater, one of which 
connects to the Ross Sea. 

This water could be home 
to microbes, the authors 
suggest, because subsurface 
temperatures at boreholes 
were within the range that can 
support microbial life (-3 °C 
to -20°C). And an outflow of 
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iron-containing subsurface 
saltwater from the Taylor 
Glacier is known to contain 
microorganisms. 

Nature Commun 6, 6831 (2015) 


| ___NEUROSCIENCE 
Brain waves go far 
in tinnitus 


Many people hear ringing in 
the ears, known as tinnitus, 
and researchers now think 
it could involve abnormally 
linked brain waves that extend 
across a large part of the brain. 
William Sedley at Newcastle 
University, UK, Phillip Gander 


SUSAN BISSELL 


NAT. HIST. MUS., LONDON/SPL 


at the University of Iowa in 
Iowa City and their team 
recorded brain activity in an 
awake 50-year-old man with 
tinnitus, who had electrodes 
implanted to study his epilepsy. 

The researchers used sounds 
to temporarily suppress the 
loudness of the tinnitus, and 
looked for shifts in brain- 
wave patterns as the tinnitus 
intensity changed. They found 
that low-frequency brain waves 
previously linked to tinnitus 
spread throughout a larger 
proportion of the auditory 
cortex than was thought. 

Mid- and high-frequency 
oscillations affected brain 
regions involved in auditory 
memory and perception. 

The three sets of brain 
waves seem to interact to 
create a widespread system of 
abnormal activity. 

Curr. Biol. http://doi.org/338 
(2015) 


METROLOGY 


Atomic clock 
smashes records 


The world’s best atomic 

clock has become even more 
accurate, neither gaining nor 
losing a second over 15 billion 
years — longer than the age of 
the Universe. 

The clock keeps time using 
2,000 ultracold strontium 
atoms trapped in a laser lattice. 
Its pendulum is a laser that has 
a frequency tuned to resonate 
with that of the photons that 
are released and absorbed 
by the atoms as they switch 
between two energy states. 

By using a more stable laser 
and by reducing the effects of 
environmental perturbations, 
Jun Ye at the JILA institute in 
Boulder, Colorado, and his 
team made the ‘ticks’ of the 
timepiece more uniform. This 
increased the clock’s accuracy 
by three times compared with 
the previous record. 

The result brings strontium- 
based optical clocks closer to 
replacing the current standard 
for time measurement, the 
caesium fountain clock, which 
makes lower-frequency atomic 
transitions. 

Nature Commun. 6, 6896 (2015) 


CLIMATE CHANGE 


Weather extremes 
linked to warming 


Global warming is responsible 
for three out of four hot 
temperature extremes around 
the world, and almost one- 
fifth of heavy precipitation 
events over land. 

Erich Fischer and Reto 
Knutti at the Swiss Federal 
Institute of Technology 
in Zurich used climate 
simulations to compare the 
frequencies of daily hot and wet 
extremes over the past century 
with those projected for the 
twenty-first century under a 
high-emissions scenario. 

They found that with 2°C of 
warming above pre-industrial 
levels, extreme events that were 
once expected to occur every 
30 years or so might happen 
every 10-20 years. 

Nature Clim. Change http://dx.doi. 
org/10.1038/nclimate2617 (2015) 


PHYSIOLOGY 


Colour tunes the 
body clock 


The changing brightness of 
sunlight during dawn and 
dusk is known to adjust 
circadian rhythms, but 
researchers now show that 
the shifting colour of the light 
does the same thing in mice. 
Timothy Brown, Robert 
Lucas and their co-workers at 
the University of Manchester, 
UK, studied mice that were 
genetically engineered so 
researchers could stimulate the 
two colour-detecting pigments 
in the mouse retina one ata 
time. When they recorded the 
firing of neurons in the brain’s 
clock centre, they found that 
this activity was in sync with 
changes in the colour of light 
between blue and yellow. 
Mice exposed to lighting 
conditions that mimic 
twilight, including 
colour changes, showed. 
differences in the timing of 
their daily body-temperature 
fluctuations, compared 
with animals exposed to 
light that shifted only in 
brightness. The authors 


RESEARCH HIGHLIGHTS BiiiSaiaa¢ 


Popular articles 
on social media 


SOCIAL SELECTION 


When bar graphs go bad 


Dodgy data analysis has come under fire in recent months 
(Nature http://doi.org/34p; 2015), and now an article in 

PLoS Biology takes aim at another staple of statistics. It argues 
that bar graphs used to describe a continuum of data are 

often uninformative and misleading, and should be purged 
from much of the scientific literature. The stance gained 

wide support from commenters on social media. “Hiding 
individual data points behind bar plots makes for ineffective 
graphics. Show the data!” tweeted Trevor Bedford, a virologist 
and computational biologist at the Fred Hutchinson Cancer 
Research Center in Seattle, Washington. “Everyone in the world 
with access to numbers and plot making software should have 
to read this paper,’ tweeted Tom Oates, a kidney specialist at 
Imperial College Healthcare NHS Trust in London. 

PLoS Biol. 13, e1002128 (2015) 


Based on data from altmetric.com. > NATURE.COM 
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suggest that the mammalian the genomes of a woolly 


mammoth (Mammuthus 
primigenius; artist’s impression 
pictured) that roamed Eurasia 
roughly 45,000 years ago, and 
of one that lived on a remote 
island some 4,000 years ago, 
just before the animals went 
extinct. Using the genomes to 
infer population trends, the 


circadian clock uses colour 
to gauge the Sun’s position 
during twilight. 

PLoS Biol. 13, €1002127 (2015) 


PALAEOGENETICS 
Genomes reveal 


mammoth history 


Genome sequences from two 
woolly mammoths provide a 
rare look at the genetic events 
leading up to extinction. 

Eleftheria Palkopoulou and 
Love Dalén at the Swedish 
Museum of Natural History 
in Stockholm sequenced 


team found that mammoths 
declined around 300,000 years 
ago, then recovered until 
about 12,000 years ago when 
mainland populations went 
extinct, leaving just a few 
hundred island-dwelling 
animals. 

The island mammoth 
showed decreased genetic 
diversity, which could have 
contributed to extinction, the 
team says, adding that this 
kind of genetic analysis 
could help conservation 

biologists to assess how 
much diversity has been 
lost from endangered 
species today. 
Curr. Biol. http://doi.org/34d 
(2015) 


> NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature.com/latestresearch 


30 APRIL 2015 | VOL 520 | NATURE | 589 


© 2015 Macmillan Publishers Limited. All rights reserved 


SEVEN DAYS nescninss 


Embryo editing 

The first ever report of genome 
editing in human embryos 
sparked debate on 22 April. 
Scientists in China used a 
technique called CRISPR/Cas9 
to edit the gene responsible 

for the blood disorder 
6-thalassaemia (P. Liang 

et al. Protein Cell http://doi. 
org/34q; 2015). The work has 
split opinion over the ethics of 
such research. A team led by 
Junjiu Huang, a gene-function 
researcher at Sun Yat-sen 
University in Guangzhou, 
applied the technique to 

86 embryos, but the faulty gene 
was replaced in only 4. Many 
embryos were found to have 
‘off-target, or unintended, 
mutations. The team used non- 
viable embryos, which could 
not have resulted in live births 
because they were produced by 
eggs fertilized by two sperm. 
See page 593 for more. 


Malaria vaccine 

A candidate malaria vaccine 
known as RTS,S/ASO1, which 
has been in development for 
nearly 30 years, has failed to 
prevent deaths in a trial of 
more than 14,000 toddlers and 
infants. The results, reported 
on 24 April (RTS,S Clinical 
Trials Partnership Lancet 
http://doi.org/34s; 2015), 
showed that the vaccine cut the 
rate of uncomplicated malaria 
by 26% in babies aged 6-12 
weeks who received a booster 
dose, and by 18% in those who 
did not. It had no effect on 
severe malaria. Regulators will 
now decide whether and how 
the vaccine should be used. 


Europe’s GM plan 
Individual member states of 
the European Union (EU) 
would be able to ban genetically 
modified (GM) crops 
otherwise approved for use 


Death and destruction in Nepal 


Widespread destruction and loss of life came 

to Nepal as a magnitude-7.8 earthquake hit 

just 80 kilometres northwest of the capital, 
Kathmandu, on 25 April. As Nature went to 
press, more than 4,000 people had been reported 
dead, although it is feared that the final death toll 
will be much higher. The quake happened along 


across the union, under plans 
by the European Commission. 
If adopted, legislation proposed 
on 22 April would allow 
countries to opt out of growing 
approved GM crops and to 
ban their importation for 
human food or animal feed. 
The EU pledged the reform in 
December 2014 (see Nature 
http://doi.org/xmq; 2014). 
Some researchers hope that 

it will allow such crops to be 
approved after years of stalling 
and debate between pro- and 
anti-GM nations. 


Thai fishing notice 
Fish imports from Thailand 
could be banned in much 

of Europe in a row over 

illegal fishing. The European 
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Commission issued a formal 
warning to Thailand on 

21 April, saying that the 
country was not doing enough 
to stop illegal fishing. Thailand 
has six months to improve 
monitoring, control and 
sanctioning systems, or it risks 
a trade ban. 


US energy review 
The United States must 
increase investment in its 
ageing energy infrastructure 
to maintain safety, reduce 
greenhouse-gas emissions and 
prepare for global warming, 
the US Department of Energy 
urged in its Quadrennial 
Energy Review on 21 April. 
The report recommends ten- 
year investments of between 
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one of the world’s biggest geological collision 
zones, in the Himalayas. Many older buildings 
with unreinforced masonry collapsed; temples 
and cultural heritage sites, including those in 
Bhaktapur (pictured), have been destroyed, says 
the United Nations Educational, Scientific and 
Cultural Organization. 


US$2.5 billion and $3.5 billion 
to replace and maintain oil 
and gas pipelines; $3 billion 

to $5 billion for programmes 
to ensure energy reliability; 
and $3.5 billion to modernize 
the electricity grid. The White 
House also announced a new 
partnership with major energy 
suppliers, intended to bolster 
energy infrastructure against 
extreme weather and climate 
change. 


Hawaiian protest 
Protests over the Thirty 
Meter Telescope (TMT) 
escalated across Hawaii last 
week, as Native Hawaiian 
demonstrators spoke out 


NAVESH CHITRAKAR/REUTERS 


DAVID CORTES SEREY/AFP/GETTY 


SOURCE: CENTER FOR TOBACCO CONTROL RESEARCH AND EDUCATION 


against the mega-telescope 
planned for construction 

on the sacred mountain of 
Mauna Kea. The project has 
legal authority to proceed, but 
has held off since early April 
because of the controversy. On 
21 April, protestors marched to 
the offices of governor David 
Ige in Honolulu, asking for the 
TMT to be halted permanently. 
They say it isa desecration of 
native land. See go.nature.com/ 
hcb6yy for more. 


Transatlantic trek 


The world’s largest liquid- 
argon neutrino detector has 
found a new home at Fermilab 
near Chicago, Illinois, an 
international collaboration 
announced on 22 April. The 
ICARUS detector, which 
weighs 760 tonnes when full, 
will journey to the United 
States from the CERN 
particle-physics lab near 
Geneva, Switzerland, where 
it is being upgraded. Starting 
in 2018, ICARUS and two 
similar detectors will hunt for 
‘sterile neutrinos, candidates 
for dark matter. Between 
2010 and 2014, it was at the 
Gran Sasso underground lab 
in Italy, where it monitored a 
now-decommissioned beam 
of neutrinos shot through the 
Earth’s crust from CERN. 


Chilean volcano 


The Calbuco volcano in 
southern Chile erupted on 
22 April for the first time since 


TREND WATCH 


Young film-goers are still viewing 


large amounts of smoking on 


screen, according to an analysis of 
1,853 box-office hits released in 
the past 12 years. The report, by 
tobacco researcher Stanton Glantz 
at the University of California, San 
Francisco, and colleagues, shows 
that in 2014, 36% of films that can 


be viewed by children without 


an accompanying adult featured 
tobacco imagery. Although that 
was down from 68% in 2002, there 
was a greater frequency of tobacco 
incidents in films that did show 


smoking. 


1972. The blast sent an ash 
plume towering more than 

15 kilometres above the cities of 
Puerto Montt and Puerto Varas 
(pictured). Ash also fell across 
much of the neighbouring 
countryside. Calbuco is one 

of the most active volcanoes 

in the southern Andes, and 

in 1893-94 caused one of the 
biggest eruptions seen in Chile 
in recent history. 


Head-injury ruling 
AUS judge has approved a 
65-year settlement to resolve 
thousands of concussion- 
related lawsuits brought 
against the National Football 
League by retired American 
footballers. The 22 April 
decision is expected to cost the 
league about US$1 billion in 
total, with awards averaging 
$190,000 for conditions 
including neuromuscular 
diseases, cognitive impairment 


SMOKING ON FILM 


and certain cases of chronic 


traumatic encephalopathy 
(CTE), a degenerative brain 
disorder diagnosed only at 
autopsy (see Nature 518, 
466-467; 2015). The settlement 
excludes mood disturbances 
sometimes attributed to CTE, 
because of the difficulty of 
ruling out other causes. 


Sa ee 
Whistleblower win 


The US Environmental 
Protection Agency (EPA) 
illegally retaliated against 

a chemist, Cate Jenkins, 

who exposed a cover-up 

of pollution dangers at the 
World Trade Center site 

after the terrorist attacks 

in 2001, said the non- 

profit organization Public 
Employees for Environmental 
Responsibility on 21 April. 
The statement follows a ruling 


The proportion of youth-rated blockbusters that contain tobacco 


imagery has fallen since 2002. 
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Ratings according to the Motion Picture Association of America. G, general audience; 
PG, parental guidance; PG-13, parents strongly cautioned; R, restricted. 


SEVEN DAYS | THIS WEEK | 


3-7 MAY 

More than 2,000 space 
and Earth scientists 
present their findings 
at the American 
Geophysical Union's 
joint assembly in 
Montreal, Canada. 
Topics include craters, 
thunder, mining and 
wine production. 
go.nature.com/mlcmea 


5-7 MAY 

The Humans to Mars 
Summit takes place 

in Washington DC. 

The event explores the 
technical, scientific and 
policy hurdles involved 
in getting humans to the 
red planet. 
go.nature.com/pxvtgh 


last month by a judge at the 
US Labor Department that 
the EPA deliberately sought 
to conceal information about 
the case. The EPA reinstated 
Jenkins in May 2012, more 
than a year on from her 
dismissal, after an appeals 
board ruled that the EPA did 
not properly document its 
case (see Nature http://doi. 
org/34b; 2012). 


New NASA deputy 


After nearly two years, NASA 
has filled its vacant deputy- 
administrator post. Dava 
Newman was confirmed 

for the position by the US 
Senate on 27 April. As an 
aeronautics and astronautics 
scientist at the Massachusetts 
Institute of Technology in 
Cambridge, her research 

has focused on human 

space exploration, and she 

is credited with developing 
lightweight spacesuits to give 
astronauts more mobility. The 
position as NASA’s second- 
in-command had been vacant 
since Lori Garver stepped 
down in September 2013. 
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Scientists in 
Egypt, Tunisia and Syria 
face major obstacles p.604 


Human embryos are at the centre of a debate over the ethics of gene editing. 


IN ] 


ON 


Embryo editing 
sparks epic debate 


In wake of paper describing genetic modification of human 
embryos, scientists disagree about ethics. 


BY DAVID CYRANOSKI AND SARA REARDON 


reported that they have used powerful 
gene-editing techniques to modify 
human embryos. Their paper’, published in 
the Beijing-based journal Protein & Cell on 
18 April, came as no surprise to the scientific 


lE a world first, Chinese scientists have 


community, but it has ignited a wide-ranging 
debate about what types of gene-editing 
research are ethical. The publication also raises 
questions about the appropriate way to publish 
such work (see ‘Publishing policy’). 

In the paper, researchers led by Junjiu 
Huang, a gene-function researcher at Sun 
Yat-sen University in Guangzhou, describe 


how they used a system of molecules called 
CRISPR/Cas9, known for its ease of use, to cut 
DNA in human embryos and then attempted 
to repair it by introducing new DNA. 

In a deliberate attempt to head off ethical 
concerns, the team used non-viable embryos 
obtained from fertility clinics, in which eggs 
had been fertilized by two sperm and so could 
not result in a live birth. 

Gene-editing techniques such as those that 
rely on CRISPR/Cas9 had previously been 
used to modify DNA in adult human cells and 
animal embryos. Earlier this year, rumours 
were circulating that the methods were being 
applied in human embryos too, but the Huang 
paper is the first published report of this. 

The team used CRISPR/Cas9 to modify a 
gene that can cause a potentially fatal blood dis- 
order called B-thalassaemia when it is mutated. 
Some researchers have suggested that such a 
procedure, if done in a viable embryo, could 
eradicate devastating genetic diseases before a 
baby is born. Others say that such work crosses 
an ethical line: in response to the rumours that 
the work was being carried out, researchers 
warned in Nature’ and Science’ in March that 
because the genetic changes to embryos — a 
procedure known as germline modification 
— are heritable, they could have an unpredict- 
able effect on future generations. 

Researchers have also expressed concerns 
that any gene-editing research in human 
embryos could be a slippery slope towards 
unsafe, unethical or non-medical uses of the 
technique. 


SERIOUS OBSTACLES 

Huang’s team says that its results reveal serious 
obstacles to using the method in a clinical 
setting. The team injected 86 embryos with 
CRISPR/Cas9, along with other molecules 
designed to add in new DNA. The research- 
ers then waited 48 hours, by which time the 
embryos would have grown to about eight cells 
each. Of the 71 embryos that survived, 54 were 
genetically tested. This revealed that just 
28 were successfully spliced, and that only 4 of 
those contained the genetic material designed 
to repair the cuts. “That’s why we stopped,” says 
Huang. “We still think it’s too immature” 

His team also found a surprising number 
of ‘off-target’ mutations assumed to be intro- 
duced by the CRISPR/Cas9 complex acting on 
other parts of the genome. The effect is one 
of the main safety concerns surrounding 
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PUBLISHING POLICY 


Gene editing poses challenges for journals 


Amid the discussion generated by a 

paper that reports gene editing in human 
embryos, the process behind its publication 
has also aroused curiosity. 

Lead author Junjiu Huang of Sun Yat- 
sen University in Guangzhou, China, says 
that the paper, published on 18 April in 
the Beijing-based online journal Protein & 
Cell, was rejected by Nature and Science, 
in part because of ethical objections. 

Both journals keep details of their review 
processes confidential (Nature’s news team 
is editorially independent of its research 
editorial team), but acknowledge that gene- 
editing of human embryos is a complicated 
issue for them. 

“This is a rapidly evolving and complex 
area for which we cannot — and should not 
— easily offer simplistic policies,” says Ritu 
Dhand, editorial director for Nature. Nature 
Publishing Group is consulting with a range 
of experts to develop a “progressive policy” 
on the issue, she says. 

Science, meanwhile, told Nature’s news 
team: “We believe strongly that the potential 
of genome editing must be viewed in terms 
of social mores and that the path forward 
must be developed through a consensus- 
building process.” 

The editors of Protein & Cell say that they 
published the paper to “sound an alarm” 
about such work. “In this unusual situation, 
the editorial decision to publish this study 
should not be viewed as an endorsement 


> germline editing because these unintended 
mutations could be harmful. 

The rates of such mutations were much 
higher than those observed in gene-editing 
studies of mouse embryos or human adult 
cells. And Huang notes that his team prob- 
ably detected just a subset of the unintended 
mutations because their study looked at only a 
portion of the genome known as the exome. “If 
we did the whole genome sequence, we would 
get many more,’ he says. 

Huang wonders whether there might be 
something intrinsically different that makes 
the human embryo more susceptible to extra 
mutations than animal embryos are. Another 
possibility — suggested by some critics of the 
work, he says — is that CRISPR/Cas9 worked 
differently in the embryos that his team used 
because they were the product of two sperm 
fertilizing an egg. 

For some, these technical challenges support 
arguments for a moratorium on all research 
on human germline modification. “I think 
the paper itself actually provides all of the 
data that we kind of pointed to,” says Edward 
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of this practice nor an encouragement of 
similar attempts,’ wrote Xiaoxue Zhang, 
managing editor at Protein & Cell, in an 
editorial published on 28 April (X. Zhang 
Protein Cell http://doi.org/35n; 2015). “We 
had serious discussion about the ethics 

of this issue,” adds the journal’s editor-in- 
chief, Zihe Rao. “We expected there might 
be difference of opinions, but it needs to be 
published to start discussion.” 

Springer, the publisher of Protein & Cell, 
confirmed that the journal had checked the 
researchers’ institutional approval and the 
consent forms from the embryo donors. 
They also confirmed that the study was 
compliant with the Helsinki declaration on 
human-medical-research ethics and with 
Chinese law. 

The paper sped through Protein & 

Cell’s review process: it was submitted 
on 30 March and accepted on 1 April. A 
spokesperson for Springer said that the 
paper was submitted with peer-review 
comments from Nature and Science and 
that the authors had made revisions on 
the basis of these, which facilitated the fast 
review. Another round of peer review was 
conducted in the two-day gap between 
submission and acceptance, said the 
spokesperson. 

Two days is “quite long”, says Rao. “You 
can e-mail the article to everyone at once. 
It’s not like the old days.” Daniel Cressey and 
David Cyranoski 


Lanphier, president of Sangamo BioSciences 
in Richmond, California, and a member of the 
group that wrote the Nature article’ calling for 
a moratorium. 

But George Church, a geneticist at Harvard 
Medical School in Boston, Massachusetts, 
disagrees that the technology is so imma- 
ture. He says that 


the researchers did “Some 

not use the most questions about 
up-to-date CRISPR/ early human 
Cas9 methods and development 
that many of the canonlybe 
researchers’ prob- @ddressed by 
lems could have been studying human 
avoided or lessened embryos.” 


if they had. 

Although researchers agree that a morato- 
rium on clinical applications is needed while 
the ethical and safety concerns of human- 
embryo editing are worked out, many see no 
problem with the type of research that Huang’s 
team did, in part because the embryos could 
not have led to a live birth. “It's no worse than 
what happens in IVF all the time, which is 
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that non-viable embryos are discarded,” says 
John Harris, a bioethicist at the University of 
Manchester, UK. “I don’t see any justifica- 
tion for a moratorium on research,” he adds. 
Church, meanwhile, notes that many of the 
earliest experiments with CRISPR/Cas9 were 
developed in human induced pluripotent stem 
cells, adult cells that have been reprogrammed 
to have the ability to turn into any cell type, 
including sperm and eggs. He questions 
whether Huang’s experiments are any more 
intrinsically problematic. 

Modifying human embryos is legal in China 
and in many US states. Asked whether Huang’s 
study would have been funded under its rules, 
the US National Institutes of Health says that it 
“would likely conclude it could not fund such 
research’, and is watching the technology to see 
whether its rules need to be modified. 

Because the embryos Huang’s team used 
were initially created for in vitro fertilization, 
not for research, the work would already have 
overcome many of the ethical hurdles it would 
face in other countries too, adds Tetsuya Ishii, 
who studies bioethics and policy at the Univer- 
sity of Hokkaido in Sapporo, Japan. 


NEXT STEPS 

Applying gene editing to human embryos 
could answer plenty of basic scientific ques- 
tions that have nothing to do with clinical 
applications, says George Daley, a stem-cell 
biologist at Harvard Medical School, who sup- 
ports editing of human embryos in vitro for 
research purposes. 

For instance, altering developmental genes 
with CRISPR/Cas9 could help to reveal their 
functions. “Some questions about early human 
development can only be addressed by study- 
ing human embryos,’ he says. 

Gene editing could also be used to engi- 
neer specific disease-related mutations in an 
embryo, which could then be used to produce 
embryonic stem cells that could act as models 
for testing drugs and other interventions for 
disease, says Daley. 

Huang now plans to work out how to 
decrease the number of off-target mutations 
using adult human cells or animal models. 

Still, researchers expect to see more gene- 
editing studies in human embryos. “The 
ubiquitous access to and simplicity of creat- 
ing CRISPRs,’ says Lanphier, whose company 
applies gene-editing techniques to adult 
human cells, “creates opportunities for scien- 
tists in any part of the world to do any kind of 
experiments they want.” He expects that more 
scientists will now start work on improving 
on the results of the Huang paper. A Chinese 
source familiar with developments in the field 
said that at least four groups in China are 
pursuing gene editing in human embryos. = 


1. Liang, P. et al. Protein Cell http://dx.doi. 
org/10.1007/s13238-015-0153-5 (2015). 

2. Lanphier, E. et al. Nature 519, 410-411 (2015). 

3. Baltimore, D. et al. Science 348, 36-38 (2015). 
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Canadian Prime Minister Stephen Harper (left) and finance minister Joe Oliver announced the budget. 


F pouitics | 


Canada pushes 
applied research 


Budget increases government partnerships with industry, 


but plays down basic science. 


BY MARGARET MUNRO 


ith six months to go before the 
next Canadian election, the reign- 
ing Conservative government has 


introduced a budget that emphasizes applied 
research and scientific collaboration with 
industry. 

The 518-page proposal, released on 


CHEMICAL WEAPONS 


MORE 
ONLINE 


21 April, will take effect in the coming weeks. 
It spells out how Prime Minister Stephen 
Harper’s government plans to balance its 
budget this year — at the same time pledging 
plenty of new spending in years ahead. 

The Canada Foundation for Inno- 
vation would receive Can$1.33 billion 
(US$1.09 billion) in new money for university 
and hospital research facilities, to be doled out 


IN FOCUS 


over six years beginning in 2017. The budget 
also includes a modest 2% hike for the country’s 
research-funding councils, much of it targeted 
for areas “that will fuel economic growth’. 

David Robinson, executive director of the 
Canadian Association of University Teachers 
in Ottawa, calls the plan a missed opportunity. 
“The government 
continues to miss the 
fact that real innova- 
tion and scientific 
advancements are 
driven by long-term 
basic research, not 
short-term market 
demands,” he says. 

Much of the promised funding for research 
is destined for industrial partnerships. This is 
in keeping with other Conservative policies, 
some of which have strained relationships 
between the government and scientists. 

The Harper government has been criti- 
cized for micromanaging how research 
money is spent and for how science is com- 
municated. “Muzzled” federal scientists are no 
longer free to discuss their research findings 
with the media without clearance from the 
government (see Nature 483, 6; 2012). And 
funding for basic-research and federal sci- 
ence programmes such as the Experimental 
Lakes Area, an acclaimed freshwater research 
centre in northwestern Ontario, has been cut 
as the Conservatives have boosted spending 
on applied research. 

“There's a trend to use universities as sur- 
rogates for industrial research, says Paul 
Dufour, a fellow at the Institute for Science, 
Society and Policy at the University of Ottawa, 
and a science-policy consultant. 

The trend continues in the latest budget. 
The proposal includes Can$119 million over 
the next two years for the National Research 
Council's “industry-partnered research and 
development activities”; Can$46 million per 
year, starting in 2016, to the research-granting 
councils in part to “support their growing 
interactions with the private sector”; and 
Can$56.4 million over four years, starting 
in 2016, for business-related internships for 
graduate students. 

Although the budget allocates just 
Can$10 million in new money this year for 
the government’s “world-class advanced 
research” initiatives — which include 
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> projects in physics, space technology 
and astronomy — it pledges to invest more 
than Can$200 million per year in new and 
expanded programmes starting in 2016. 

Dufour says that the research land- 
scape has become “a bit of a dog’s break- 
fast” with the Conservatives’ targeted 
research initiatives. And he notes that 
the focus on industrial research has not 
improved Canada’s overall research and 
development performance. 

The Organisation for Economic Co- 
operation and Development (OECD) 
reported in November that Canada has 
dropped out of the top ten research and 
development performers, ceding ground 
to more aggressive countries such as Tai- 
wan. The OECD also reports that Canada 
has seen one of the sharpest declines in the 
percentage of gross domestic product spent 
on research. 


ELECTION TOPIC 

Opposition parties hope to make science 
an issue in the October election, but their 
criticism might be blunted by initiatives 
such as Harper’s Can$243-million, ten- 
year commitment to the Thirty Meter 
Telescope (TMT), announced earlier this 
month and highlighted in the budget. The 
international observatory is to be con- 
structed on the summit of Mauna Kea in 
Hawaii. 

“You can rest assured the TMT will be 
trotted out during the campaign as evi- 
dence that they are investing in research 
that is probing the origins of the Universe,’ 
says Scott Findlay, a biologist at the Uni- 
versity of Ottawa and founding member 
of Evidence for Democracy, a group that 
pushes for more science-based decision- 
making in government. 

The budget also commits Canada to 
supporting the International Space Station 
until 2024, an extension that the United 
States and Russia have already committed to. 

As the election nears, the government 
is expected to start announcing projects 
financed by its Can$1.5-billion Canada 
First Research Excellence Fund, which 
aims to put Canadian universities “among 
the best in the world for talent and break- 
through discoveries”. Applicants must spell 
out the strategic relevance of proposed 
projects and their potential to “create long- 
term economic advantages for Canada”. 

Many of Canada’s university presidents 
enthusiastically endorsed the fund when 
Harper announced details in December. 
And some were quick to applaud the new 
budget’s provisions. “They will benefit 
Canada and Canadians now and for years 
to come,’ says David Barnard, president of 
the University of Manitoba and chair of the 
Association of Universities and Colleges of 
Canada. @ 
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Pope Francis says that humanity “has slapped nature in the face”. 


Vatican hosts 
climate meeting 


Religious leaders and scientists gather to discuss moral 
implications of global warming as Pope drafts key letter. 


BY EDWIN CARTLIDGE 


progressive teachings on sexuality, con- 

traception and abortion. Now he and 
the Catholic Church are tackling another 
contentious issue: climate change. 

This week, scientists, religious figures and 
policymakers gathered at the Vatican to dis- 
cuss the science of global warming and the 
danger posed to the world’s poorest people. 
The meeting came as Francis prepares an 
encyclical letter to bishops on climate change 
for release this summer, ahead of United 
Nations climate negotiations in December. 
The Pope’s strong feelings on the matter are 
apparent: in January he said that people were 
“mostly” responsible for recent warming and 
that they have “slapped nature in the face”. 

Two scholarly Vatican groups that advise 
the Pope on scientific issues organized the 
meeting. One, the Pontifical Academy of Sci- 
ences, has held several related conferences, 
including a 2011 meeting on melting glaciers 


Pp ope Francis has drawn attention for his 
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and another last year on sustainable devel- 
opment. But the climate gathering this week 
is the first at the Vatican to include religious 
leaders as well as scientists. 

Roughly 20-25 individuals representing 
religions such as Protestant and Orthodox 
Christianity, Judaism and Islam joined a 
similar number of scientists. Nobel Prize- 
winning chemist Paul Crutzen, climatologist 
James Hansen and former UK Royal Society 
president Martin Rees were scheduled to 
attend, along with UN secretary-general Ban 
Ki-moon and Italian President Sergio Mat- 
tarella. All participants were asked to sign a 
statement “on the moral and religious impera- 
tive of sustainable development”. 

Peter Raven, a botanist at the Missouri 
Botanical Garden in St Louis and an organizer 
of the meeting, says that such messages are 
crucial for winning public support for policies 
to reduce greenhouse-gas emissions. “The 
only way politicians can put binding agree- 
ments in place is if they are convinced that 
enough people care about the problem,” he 
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says. “And that will only happen if the prob- 
lem has a strong ethical component.” 

Among the moral arguments for action on 
climate change is the idea that society must 
preserve Earth for future generations, says 
another meeting organizer, climate scientist 
Veerabhadran Ramanathan of the Scripps 
Institution of Oceanography in La Jolla, Cali- 
fornia. A further argument is that the world’s 
3 billion poorest people produce a fraction 
of humanity’s annual greenhouse-gas emis- 
sions — about 5% — but are predicted to 
suffer disproportionately from the impacts 
of climate change, such as more extreme 
weather. 

Francis is not the first major religious 
leader — nor even the first pope — to speak 
out on climate change. In a 2009 statement, 
Archbishop of Canterbury Rowan Williams 
and other UK religious representatives rec- 
ognized a moral imperative to address the 
causes of global warming. And last month, 
the presiding bishop of the US Episcopal 
Church, Katharine Jefferts Schori, described 
those who attribute global warming to purely 
natural causes as “often driven by greed and 
self-centred political interests, and sometimes 
by wilful blindness”. 

Closer to home, Francis’s predecessor 
Benedict XVI made numerous statements 
on the environment, describing climate 
change in 2011 as a “worrying and complex 


phenomenon”. But Ramanathan thinks that 
the huge popularity of the current pope pre- 
sents a unique opportunity to raise awareness. 
“Pope Francis has become a Kennedy-like 
figure who goes beyond Catholicism and 
appeals to the whole world,” he says. 

Even a popular pope can stir contro- 
versy, however. Francis’s support for action 
on climate change 


has raised hackles “Ithink he will 
among some con- say that there 
servative Catholics, isaconnection 
such as Steve Moore, between how 
who is chiefecono- wetreatthe 
mist at the Heritage planet and how 
Foundation, a free- wetreat one 
market think tank gpyother.” 

in Washington DC. 

Francis has “allied himself with the far left”, 
Moore says. 


But Dan Misleh, executive director of the 
Catholic Climate Covenant, an environmen- 
tal group in Washington DC, says that Francis 
is unlikely to endorse specific policies to fight 
climate change. “I don't think he is going to 
say that a carbon tax is preferable to a cap- 
and-trade programme,’ Misleh says. “But I 
think he will say that there is a connection 
between how we treat the planet and how we 
treat one another.” 

Such a strategy will allow Francis to avoid 
seeming “ideological” while still speaking 
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clearly about the role of humans in climate 
change, says James Bretzke, a Catholic theo- 
logian at Boston College in Massachusetts. 
Already, he says, several Vatican figures who 
are likely to be involved in drafting the Pope's 
encyclical have made unequivocal statements 
on global warming. Last year, for example, 
Bishop Marcelo Sanchez Sorondo, chancel- 
lor of the Pontifical Academy of Sciences, said 
that “essentially all documents published now 
accept as a scientific truth that climate change 
is due to human activity”. 

The document set to be signed at this week’s 
meeting contains a very similar statement, 
according to co-organizer Partha Dasgupta, 
an economist at the University of Cambridge, 
UK. Attendees were scheduled to discuss the 
physical and chemical processes that underlie 
global warming, he says, but a debate on the 
idea that human activities drive that warming 
was not planned. “Religious leaders might ask 
whether other scientists have different views,’ 
says Dasgupta, adding that the small minor- 
ity of scientists who are sceptical of climate 
change were not represented at the meeting. 

Still, the Pope will be careful to point out 
where scientific uncertainties remain, Bretzke 
predicts. Such caution is borne of the Vatican's 
famous over-certainty in the past. “Centuries 
ago, the Church got on the wrong end of cos- 
mological arguments,” he says. “It has been 
chastened by that.” m 


MEDICAL RESEARCH 


Gene-testing firms set sights 
on drug development 


Companies race to gather large DNA data sets in bid to find treatments based on genetics. 


BY ERIKA CHECK HAYDEN 


( ompanies that offer genetic testing 
directly to consumers are renewing 
their ambitions. Recent moves by US 

regulators have given the firms fresh hope that 

the large genetic data sets they amass will have 
commercial as well as scientific value, spawn- 
ing diagnostic tests or drugs. 

The moment seems ripe. In February, the US 
Food and Drug Administration (FDA) allowed 
23andMe — a company in Mountain View, 
California, that has offered genome analysis to 
consumers for nearly a decade — to begin mar- 
keting a test for mutations that cause a rare dis- 
ease called Bloom syndrome. This was the first 
time that the agency had approved a genetic test 
marketed to the public, not clinicians — and 
companies are betting that it will not be the last. 


Many expect the FDA to further expand 
the types of medically relevant information 
that consumer genetic tests can supply. The 
February decision “is a huge step for the field’, 
says Ken Chahine, a senior vice-president at 
Ancestry.com DNA of Provo, Utah, which 
offers genetic testing to the public. “You're 
starting to see others get into this space 
because everyone sees a lot of value in it” 

Ancestry.com DNA, a subsidiary of 
genealogy website Ancestry.com, is one of 
those newcomers. The company has so far 
collected DNA from 850,000 customers, which 
it uses to help people to find relatives among 
members of its genealogy-focused social net- 
work. It is now exploring the idea of collecting 
users’ medical histories, too, so that it could 
offer them medically relevant information and 
investigate the genetic roots of diseases. 
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23andMe is farther down this road and has 
already attracted interest from health-care 
firms. In January, it signed a US$60-million 
deal with California biotechnology company 
Genentech, which will use genetic data from 
23andMe customers to develop therapeutics. 
The personal-genomics firm has also hired 
former Genentech executive Richard Scheller 
to lead an in-house effort to develop drugs. 
And, similar to Ancestry.com, 23andMe has 
access to a home-grown social network that 
encourages its customers to interact with the 
company and each other. 

Commercial firms are not the only group 
seeking to build large pools of participants 
for genetic studies — or the only ones har- 
nessing social networks for this cause. The 
medical-research study Genes for Good, 
which launched on 31 March, is seeking > 
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> participants through Facebook. Users 
of the social-networking service can down- 
load a Genes for Good app that guides 
them through a series of health surveys to 
collect basic information, such as height, 
weight and whether a person uses tobacco. 
Users who complete 15 of these surveys 
can receive a ‘spit kit’ to collect saliva for 
DNA sequencing, and are encouraged to 
continue providing health information 
afterwards. 

“We wanted to do something that would 
give us the possibility of capturing very 
large numbers of people in a way that is 
more engaging than the traditional types 
of study,” says Gongalo Abecasis, a statisti- 
cal geneticist at the University of Michigan 
in Ann Arbor who is leading the study. 


WIDENING THE NET 
Drawing on 23andMe’s example, the 
project will return genetic information 
to users — such as hints about ancestry 
drawn from a user’s genetic markers. The 
project also hopes to share the information 
it collects with other academic scientists, 
or even companies 


“They’re smart developing new 
to tether their therapies, after 
research toa removing identi- 
platform that fying information 
has amuch such as names and 
broader market addresses. 
base of people.” _So far, Genes for 
Good has recruited 


4,200 potential participants of the 20,000 it 
seeks, and hundreds of those have already 
sent back their spit kits. That is a relatively 
small number of genomes compared to the 
amount of genetic information contained 
in private databases. And it is dwarfed by 
the amount of data being amassed by public 
projects, such as one run by the UK Depart- 
ment of Health that aims to sequence the 
genomes of 100,000 patients. 

Such efforts, in theory, could compete for 
participants with commercial ventures such 
as Ancestry.com DNA and 23andMe, which 
ask customers to pay for testing similar to 
the type that Genes for Good will offer for 
free. It is not clear whether that will affect 
the companies’ ability to attract custom- 
ers and expand their data sets, given their 
significant head start. 

Genes for Good is tapping the power 
of Facebook, with its 1.44 billion active 
monthly users. Given this potential, the 
project could become more massive than 
any previous genetic study, says Michelle 
Meyer, a bioethicist and legal scholar at the 
Icahn School of Medicine at Mount Sinai 
in New York. “They’re smart to tether 
their research to a platform that has a 
much broader market base of people than 
we're used to seeing in research studies,” 
she adds. = 
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DNA from humans who lived in the Andes 9,000 years ago gives clues to how South America was peopled. 


ARCHAEOLOGY 


South America 
settled in one go 


Tests of human remains from a cave high in the Peruvian 
Andes point to a single early migration. 


BY EWEN CALLAWAY 


umans settled in South America in 
H a single wave of migration not long 

after their ancestors first crossed from 
Siberia into the Western Hemisphere during 
the last ice age, genetic evidence suggests. The 
finding, based on DNA from the remains of 
five ancient humans who lived high in the 
Peruvian Andes, also hints at how ancient 
Andeans evolved to thrive at altitudes of more 
than 4,000 metres. 

Presented on 16 April at the annual meeting 
of the Society for American Archaeology (SAA) 
in San Francisco, California, the research sheds 
light on the last major continental migration 
in human prehistory — and one of the least 
understood. 

South America’s oldest-known human 
occupation site — the 14,600-year-old Monte 
Verde settlement in Chile — suggests that 
people quickly reached the continent after 
crossing the Bering land bridge one or 
two thousand years earlier, perhaps skirt- 
ing the Pacific coast. But some argue for a 
second migration. They use skeletal evi- 
dence to propose that the long, narrow 
skulls of South Americans living more 
than about 5,000 years ago differ too much 
from the rounder heads of more-recent 
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inhabitants, and of living indigenous people, 
to represent one continuous population’. 

In the 1950s and 1960s, Peruvian archaeolo- 
gist Augusto Cardich discovered human 
remains bearing the characteristic ancient 
skull shape in a rock shelter perched high 
in the Andes in a region called Lauricocha. 
Carbon dating put the site at around 9,000 years 
old, making it a classic archaeological example 
of early high-altitude living. Later research- 
ers dated remains from Lauricocha closer to 
5,000 years old, greatly diminishing its appeal 
for researchers interested in the early prehistory 
of South America. “People simply forgot about 
this site. Nobody was interested any more,’ says 
Lars Fehren-Schmitz, a biological anthropolo- 
gist at the University of California, Santa Cruz. 

More than halfa century after the site’s dis- 
covery, Fehren-Schmitz’s team got permission 
to look at five human skeletons excavated from 
Lauricocha and held at the National Museum 
of the Archaeology, Anthropology and History 
of Peru in Lima. The team redated the remains, 
remeasured the skulls and extracted DNA. 

Their work, presented by Fehren-Schmitz 
at the SAA meeting, paints a complicated 
picture of Lauricocha. Two of its residents, 
a woman and a 2-year-old child, died nearly 
9,000 years ago. The third, a man, perished 
around 2,500 years later and another man died 
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about 2,300 years later still. The fifth specimen 
was not dated because ofits condition. Only the 
womans skull had a long, narrow shape, which 
is known as dolichocephaly. 

To gauge whether the Lauricocha residents 
were descended from members of more than 
one migration, the team sequenced the DNA 
from their mitochondria, cellular organelles 
that are passed directly from mother to child, 
and thus trace maternal ancestry. All five peo- 
ple were found to be descended from maternal 
lines that are common among modern and 
ancient indigenous people of North and South 
America. The males’ Y chromosomes put them 
on a lineage that arose in the region around 
the Bering Strait some 17,000 years ago — the 
most widely accepted time and place for the 
original human migration into the Americas. 
These and other DNA data suggested that all 
the Lauricocha humans are descended from the 
first humans to reach the Americas, supporting 
one migration into South America. 

Not everyone accepts the argument. Such a 
wide-ranging conclusion cannot be drawn from 
a handful of remains found at a single site, says 
Tom Dillehay, an archaeologist at Vanderbilt 
University in Nashville, Tennessee. He argues 
that the 5,000-year gap between the earliest 
inhabitants at Monte Verde, which Dillehay 
excavated, and the earliest Lauricocha skeletons 
leaves the door open for earlier migrations. 


“There may well be a single migration — they 
don't have the evidence for it?” he says. 

Ancient human DNA from other parts of 
South America, such as the Amazon, may yet 
point to a second migration into the continent, 
agrees Fehren-Schmitz. His team is now analys- 
ing full genomes from Lauricocha and other 
South American sites to obtain a fuller picture 
of the continent's past. 

Deborah Bolnick, a biological anthropologist 
at the University of Texas at Austin, accepts the 
argument for a single migration. But regardless 
of the number of waves, there would have been 
further population movements after that initial, 
southbound founding trek, she says, adding that 
analysis of full genomes from ancient South 
Americans will give a more nuanced picture of 
the continent's prehistory. Researchers studying 
ancient DNA have found evidence for a massive 
migration from present-day Russia and Ukraine 
into western Europe around 4,500 years ago’, 
and Bolnick says that similar upheavals are 
bound to have happened in the Americas. In 
a 2014 paper, for instance, Fehren-Schmitz 
documented a migration into the Central 
Andes about 1,400 years ago, possibly driven 
by drought in lower-lying areas’. 

Ancient South American genomes may also 
show how humans adapted to the New World. 
Ina separate study, Fehren-Schmitz and his col- 
leagues looked at a gene variant that protects 
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against altitude sickness. Between around 8,500 
and 600 years ago, the presence of the variant 
increased markedly among the Andeans. 

Fehren-Schmitz says that the finding is 
merely suggestive of local adaptation. But with 
full, ancient genomes in hand, researchers will 
be able to look more thoroughly for signs of 
adaptation to high altitude, says Bolnick. With 
that question in mind, her team is sequencing 
DNA from remains of mountain residents of 
Argentina. Ancient genomes from the Ameri- 
cas may also reveal how humans adapted to 
shifts in diet — early Americans domesticated 
maize (corn), potato and other crops — and to 
diseases imported from Europe, such as small- 
pox. South American prehistory, it seems, is 
about to get alot more complicated. m 
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CORRECTION 

The Editorial ‘Highway to health’ (Nature 
520, 407; 2015) stated that the African 
Centres for Disease Control and Prevention 
would need at least $300 million and 

60 staff members. This should have read 
$60 million and 300 staff. 
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COLLATERAL 


DAMAGE 


How acase of misconduct brought a leading 
Japanese biology institute to its knees. 


BY DAVID CYRANOSKI 


ironobu Fujiwara was already troubled as he made his way to work on 5 August 

2014. As a laboratory head at the RIKEN Center for Developmental Biology 

(CDB) in Kobe, Japan, Fujiwara had been enduring criticism ever since a case 

of misconduct had been exposed at the centre earlier that year. A media furore 
had escalated week by week, as newspapers, social media and television programmes 
all demanded an explanation for how scientists at a prestigious institute could have 
conducted shoddy work. Although he was not involved in the suspect studies, Fujiwara 
and many other employees at the centre felt under attack. 

And then, when he arrived that day, he heard terrible news. Yoshiki Sasai — a found- 
ing member of the CDB who had been implicated in the misconduct case — had hanged 
himself in an adjoining building that morning. “I was just in shock,’ Fujiwara says. 
“T didn’t know what to think, or whether to believe it was true” 

Revelations of scientific misconduct always cause collateral damage: they taint 
colleagues and co-authors of the person responsible, and can close down labs. But the 
case at the CDB triggered unusually strong and far-reaching aftershocks. 

The case involved two high-profile papers in Nature’ that described a surprisingly 
simple method to make pluripotent stem cells — cells with the prized ability to develop 
into any of the body’s cell types. After the misconduct came to light, a crucial report 
blamed not just the individuals involved, but also the entire centre — and recommended 
that it be dismantled. Since then, the CDB’s funding has been slashed, half of its labs 
have been closed, merged or assigned to other organizations, and its leadership has been 
replaced. The upheaval reaches far beyond the centre. A government science adminis- 
trative reform has been put on hold and the scientific community across Japan is now 
bracing for the impact of anti-misconduct policies introduced in the wake of the affair. 

To many scientists and journalists in Japan, this was an appropriate response. They 
thought that the stem-cell results had been sensationalized, and that it was fitting to 
take drastic action when those results were shown to be false. 

But other scientists in Japan and many abroad call it an overreaction and say that 
events over the past year reveal how responses to a crisis can create their own problems. 
The way that scientists and the media reacted exposed long-standing jealousies in Japan 
towards the 15 well-funded RIKEN centres and institutes, which had already made 
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the CDB a target for critics. The frenetic media 
and social-media response created a storm that 
bureaucrats and science-policy administra- 
tors were desperate to quell. And reporting by 
Nature calls into question whether the CDB 
was given enough ofa chance to defend itself: a 
committee behind the key report on the future 
of the CDB did not interview most of those 
involved to find the causes of the misconduct. 
Teruo Kishi, who led the committee, defends 
the process. “You seem to think that by talking 
to people involved we would find something 
out,” he says. “But we wouldn't have learned 
anything from asking them” 

Some scientists worry that the ferocity of 
the response will hamper research for current 
or former CDB scientists and even dampen 
Japan's ability to support innovative science in 
the long run. “It was only two papers, but from 
the beginning to the end the media kept blow- 
ing it up larger and larger; says former can- 
cer geneticist Yuko Ito, now a science-policy 
expert at the Japan Science and Technology 
Agency in Tokyo. “In the end, a lot of scientists 
became victims.” 


SCIENTIST’S PARADISE 

RIKEN was founded in 1917 as the Insti- 
tute of Physical and Chemical Research in 
Tokyo, and it expanded heavily into biologi- 
cal sciences with a series of institutes that 
opened from the late 1990s onwards. The 
centres became known in Japan as a scientist’s 
paradise, because researchers there had no 
undergraduate teaching responsibilities and 
enjoyed generous salaries and research funds 
that meant that they did not depend on grants. 
Even before the stem-cell crisis, “the relation- 
ship between RIKEN and universities was not 
really good’, says Hiroshi Hamada, a develop- 
mental biologist formerly at Osaka University, 
who took over as CDB director this year. 

Even among RIKEN centres, the CDB stood 
out. Launched in 2000, when Japan was trying 
to rejuvenate its research infrastructure, the 
institute abandoned the crusty hierarchical 
structure that encumbered university laborato- 
ries. Scientists and staffaddress each other with 
the common san rather than the sensei typically 
used for superiors. In a 2002 interview with 
Nature, Sasai vowed to “give young researchers 
a degree of independence” that was previously 
unknown in Japan (see Nature 415, 952-953; 
2002) — and soon young principal investi- 
gators, some still in their 20s, were mapping 
out their research programmes. A handful of 
established researchers, known as group direc- 
tors and including Sasai and founding director 
Masatoshi Takeichi, oversaw operations. 

The approach quickly paid off, and high- 
profile results emerged. Lab leader Mitinori 
Saitou won acclaim for engineering germ cells 
in a dish (see Nature 500, 392-394; 2013); Sasai 
built a reputation for his skill growing eye and 
brain structures in culture (see Nature 488, 
444-446; 2012). Ina country where research 
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has often struggled to make an international 
impact, the CDB developed a global reputa- 
tion for exceptional work in developmental 
biology. The institute was proud to state that 
of 163 papers that its researchers published in 
2013, one-third were in leading international 
journals including Nature, Science and Cell. 

The centre seemed to score another big coup 
in January 2014, when some of its scientists pub- 
lished the two papers on pluripotent stem cells. 
Developing the technology, called stimulus- 
triggered acquisition of pluripotency, or STAP, 
was the project of biochemist Haruko Obokata, 
who had started the work at Harvard Univer- 
sity in Cambridge, Massachusetts, and brought 
it to the CDB. She had worked with three highly 
respected scientists at the centre: mouse-cloning 
pioneer Teruhiko Wakayama, stem-cell biolo- 
gist Hitoshi Niwa and Sasai. 

When the technique was published, it 
caused excitement in Japan. Pluripotency had 
become almost a household word since Shinya 
Yamanaka at Kyoto University had won a Nobel 
prize in 2012 for his work on a related tech- 
nique to make induced pluripotent stem (iPS) 
cells. And the media made much of Obokata, a 
young, quirky woman who wore a kappogi —a 
type of traditional apron that she had received 
from her grandmother — instead ofa lab coat, 
and whose laboratory walls were painted pink 
and yellow. She was everything that the tradi- 
tional Japanese scientist was not. 

Within weeks of the papers’ publication, 
however, the work began to unravel. Science 
blogs pointed to manipulated figures and 
other scientists could not repeat the results 
(see go.nature.com/h9tr5w). A committee 
at RIKEN investigated and found signs of 


ae 


misconduct. RIKEN officials held marathon 
four-to-five-hour press conferences, formally 
accused Obokata of misconduct on 1 April, 
and recommended that the papers be retracted, 
which they later were. The committee’s report 
said that Sasai and Wakayama had not been 
involved in the misconduct, but shared “grave 
responsibility” for not catching the problematic 
data. Niwa was cleared of wrongdoing. 

At this point, the typical university fraud case 
in Japan — and there have been several high- 


“IN THE END, 
ALOT OF 
SCIENTISTS 
BECAME 
VICTIMS.” 


profile examples over the past 15 years — winds 
down. The researcher implicated in the miscon- 
duct usually resigns and the tainted papers are 
retracted. Witness, for example, the resigna- 
tion of University of Tokyo molecular biologist 
Shigeaki Kato and the retraction of dozens ofhis 
papers between 2012 and 2014. In that case, the 
damage was limited to members of Kato’ group. 

But with STAP, the media was hungry for 
more. News teams camped in the lobby of the 


Teruo Kishi (second left) of the RIKEN reform committee, at a 2014 press conference on the CDB’s future. 
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CDB and filled the corridors of the science min- 
istry, looking for twists to the story. They trailed 
Obokata, Sasai and many other CDB research- 
ers and administrators. (Japan’s national 
broadcaster, NHK, apologized after Obokata 
was injured in one scrape with reporters.) 

News articles, tweets and blogs started 
spreading the blame to the CDB and RIKEN as 
a whole. Critics said that the misconduct inves- 
tigation had been hastily prepared, and had 
failed to get to the bottom of the problem. They 
also focused on a patent that RIKEN and the 
scientists involved in the work had filed on the 
STAP technique — standard practice in aca- 
demia, but taken as evidence that the centre and 
its researchers were driven by economic incen- 
tives. Noriko Osumi, a developmental biologist 
at Tohoku University and one of the most vocal 
critics of RIKEN during the STAP crisis, faulted 
the organization for “inviting excessive atten- 
tion from the media’, which she blamed on “the 
influence of commercialization and industri- 
alization of research in the life sciences”. The 
desire for money and media attention was seen 
by others as a reason that senior scientists at the 
CDB had thrown their weight behind the STAP 
project and let their guard down, allowing poor 
science to take place. 

Young CDB investigators who had nothing 
to do with Obokata’s work were worried that 
their reputations were being tarnished. “At first 
the line was blurred, then we saw that there 
was no line at all,” says Yu-Chiun Wang, who 
had arrived at the CDB in October 2013 from 
Princeton University in New Jersey. “We were 
all seen as part of the same criminal complex.” 
CDB scientists, technical staff and secretaries 
were being slandered online, says Takeshi Imai, 
a neurobiologist at CDB. “Myself, my lab mem- 
bers and colleagues were also there,’ he says. 
Even researchers at other RIKEN centres felt 
under attack. 


QUIETING THE STORM 


The media hullaballoo was a headache for the 
science ministry, which was making plans to 
give certain research organizations unprec- 
edented autonomy. RIKEN was to be the first. 
The ministry “wanted to finalize the miscon- 
duct matter quickly’; says Maki Kawai, a former 
RIKEN executive director. It thought an aggres- 
sive action “would seem a more reasonable 
response to the taxpayers’ voice, which is the 
media”. 

On 9 April 2014, shortly after Obokata was 
charged with misconduct, RIKEN established 
an independent ‘reform committee chaired 
by Kishi, a lean 75-year-old materials scientist 
with a long list of top administrative positions. 
On 12 June, the Kishi committee released eight 
recommendations, including the promotion of 
research integrity, new fraud-prevention meas- 
ures and a more thorough investigation of the 
STAP papers. But among these, one recom- 
mendation leapt out: to “dismantle” the CDB. 
“The Kishi report threw me out of my chair,’ 
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says Ichiro Hiratani, a developmental biologist 
who had joined the CDB the previous year. 

The Kishi committee's main task was to sug- 
gest measures to prevent misconduct, but half 
ofits report analysed what had happened in the 
STAP misconduct episode, and why. The centre 
should be dismantled, the report said, because 
“the CDB, as an organization, had structural 
flaws, that induced or could not deter fraud”. 
One of the main problems was an ossification 
of leadership: since its founding, the centre had 
had basically the same group directors. These, 
the report said, had come unconsciously to rely 
on each other in a cosy relationship that pre- 
vented critical scrutiny of each other's decisions. 

Takeichi says that he and the CDB had been 
aware that a change was overdue well before 
the crisis. Over the preceding three years, the 
centre had searched for a foreign director to 
breathe new life into the CDB and increase its 
global status, but found it difficult to recruit a 
scientist of suitable calibre. However, Takeichi’s 
leadership had not been in question before the 
STAP crisis. In 2011, RIKEN’s advisory com- 
mittee opposed his retirement. 

The Kishi report said that poor governance 
at the CDB had led to mistakes. It accused the 
centre of circumventing normal procedures 
to help hire Obokata, and “surmises” a reason: 
“the CDB was strongly motivated by a desire 
to acquire a groundbreaking result” that would 
outdo Yamanaka’s discovery of iPS cells. The 
report “surmises” that Sasai “naturally was 
involved in STAP in anticipation of the huge 
budget” that it would bring to the CDB. The 
report also said that Obokata’s decision to wear 
a kappogi was part of a “showy PR strategy” 
orchestrated by Sasai. 

In response to the report, RIKEN quickly 
convened a committee to start working out 
how to implement the recommendations. 
“Given how harshly the media was treating 
RIKEN, it had no choice but just accept it,” a 
science-ministry official who does not want 
to be named told Nature. But Takeichi and 
Sasai contested many of the assertions in the 
report. Takeichi said that the hiring process for 
Obokata was normal; he and Sasai denied that 
there was a desire to outdo the iPS discovery, 
and other researchers at the CDB say that there 
was no pressure to do so. Part of Sasai’s job as 
CDB deputy director was to organize and seek 
funding — a job at which he was very success- 
ful — but in an e-mail to Nature’s news team 
in June 2014, he denied that he had become 
involved in STAP because it would attract 
money to the institute. Sasai, Takeichi and 
RIKEN’s public-relations office have all denied 
being involved in Obokata’s choice of clothes. 

In the interview with Nature in February, 
Kishi stood by his committee’s report and 
acknowledged that it went beyond fact. “There 
was a lot of speculation, guesses, in the report,’ 
he said. “But they were guesses made with con- 
fidence; on the basis of what he had seen in 
long, televised press conferences. 
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Kishi said that in preparing its report, the 
committee did not talk to Obokata, Sasai or 
anyone at the CDB aside from Takeichi — a 
fact that other members of the committee con- 
firmed. Kishi says that RIKEN would not let 
the committee meet with Sasai. But, Kishi said, 
“his televised conference made me feel that I 
didn’t need to hear any more than what he said. 
I felt he said enough, he cannot be changed.” 


"GETTING 
ALMOST NO 
SUPPORT FROM 
SCIENTISTS WAS 
SHOCKING AND 
DEPRESSING.” 


Then Kishi passed a surprisingly harsh judge- 
ment on Sasai. “Even if we had asked him, he 
would not have told us the truth,’ he said. This 
differs from the opinion of those who knew 
Sasai well, who say he was an honest, broad- 
minded person, devoted to scientific research. 

Committee member Masaki Nakamura, a 
research-integrity specialist and historian of 
science at Osaka University, also defended the 
report. He says that speculation was used in it 
the same way that a prosecutor will fill in the 
motivations of a defendant in a court of law. 
Four other members of the Kishi committee 
did not respond to requests for comment, or 
refused to comment. Nicholas Steneck, an 
expert in research integrity at the University of 
Michigan in Ann Arbor, says that in a miscon- 
duct case, he personally “would not be comfort- 
able speculating about anything that was not 
based on some facts and direct information” 
He says, however, that such speculation is sur- 
prisingly common among scientists. “History 
and reporting on current events should follow 
basically the same methods as science,’ he says. 

When the Kishi report came out, many for- 
eign scientists found its conclusions excessive 
and arbitrary, and more than 150 letters of sup- 
port were sent to the CDB. But in Japan, it was 
mostly received uncritically. Scientific organi- 
zations such as the Science Council of Japan 
threw their weight behind the judgement. The 
most dispiriting thing for many CDB research- 
ers was the indifference or criticism from their 
scientific colleagues — some of it a reflection 
of the resentment of the CDB and RIKEN that 
had built up over the years. “Newspaper and 
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media, fine, they are trying to sell newspapers,” 
says Hiratani. “But getting almost no support 
from scientists was shocking and depressing.” 

The fallout was especially hard on Sasai. Until 
June he had been “somehow getting by’, says 
Keiko Muguruma, a stem-cell biologist at the 
CDB and a frequent collaborator with Sasai — 
they had just submitted a paper for publication. 
“Tn science, he felt he could recover from the 
harm to his reputation. But for things that he 
had no control, like the dismantling and budget 
decrease, which would affect all the young 
researchers, he felt guilt and responsibility,’ says 
Muguruma. According to the lawyer working 
with Sasai’s family, the Kishi report and media 
attacks were factors in explaining Sasai’s suicide. 
Kishi and Nakamura respond that it is difficult 
to know why Sasai took his own life. 

In August, the CDB began making changes in 
line with the Kishi report. RIKEN announced a 
plan to introduce new fraud-prevention meas- 
ures and strengthen governance. 

Months later, in November, Takeichi stepped 
down and an interim director was put in place. 
The CDB saw 9 of its 40 laboratories shifted 
to other RIKEN centres, and another 11 were 
merged or closed. The centre also changed its 
Japanese name to “centre for research into the 
formation of multicellular systems’, although 
it kept its original English name. In December, 
after announcing that she had failed to repeat 
her experiments, Obokata resigned. 

The changes are still going on. On 1 April 
this year, Hamada took over as director and 
a 40% cut to the centre's budget kicked in. 
Researchers have been scrambling to supple- 
ment their finances with grants. 


ASSESSING IMPACT 


In interviews in the past two months, Kishi and 
Nakamura expressed tempered views of the 
CDB. Nakamura says that Takeichi’s leader- 
ship had been “extremely good in comparison 
to other universities and research institutes”. 
Although “we wrote about problems in their 
research ethics and education’, Nakamura says, 
the CDB “was relatively speaking really quite 
advanced”. 

The word dismantle (kaitai) was a strategic 
choice meant to please an angry press rather 
than to put an end to the CDB, say both Kishi 
and Nakamura. It was “an appeal to society, 
trying to show the idea that RIKEN was taking 
this problem seriously’, Nakamura says. Both 
say that they wanted to see the centre rebooted 
under new leadership and with a new name. 
Kishi says he wanted a “reborn CDB” — and 
in his view, the restructuring has met that aim. 
“Nothing’s really changed,’ says Kishi. 

But that is not how it looks to those on the 
ground. Hamada and other CDB researchers 
worry about whether they will be able to attract 
new principal investigators and postdoctoral 
researchers to the now-tarnished centre. It has 
lost two of its most prominent researchers in 
Sasai and Niwa, who is moving to Kumamoto 
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Yoshiki Sasai (right), co-author of the controversial stem-cell papers, bows after speaking at a press conference in April 2014. 


University this year. Fujiwara has stayed at the 
CDB, but says that his research has suffered. 
When he was supported by secure funding from 
the centre, he had planned a five-year project to 
map proteins in the extracellular matrix of hair 
follicles. Now that he must secure year-to-year 
grants, that project looks harder to realize. 

The impact of the STAP episode has also been 
felt further afield. Since August, policies for pre- 
venting and dealing with science misconduct 
have been released by the science ministry, the 
health ministry and other bodies. As part of this, 
data-management rules are being tightened to 
ensure that data are checked more often and are 
made available for verification. Such rules are 
frustrating scientists, and there are widespread 
concerns that research is becoming overly 
bureaucratic. “We hada pretty good system, but 
now we have to check this and that,’ says Kawai. 
“I don't want to grab time from scientists.” 

The new science-ministry guidelines make 
an institution responsible for fraud, and say 
that its funding can be cut if fraud-prevention 
measures are deemed inadequate. Imai says 
that this could be counterproductive, because 
it could push institutes and scientists to cover 
up suspected misconduct. 

STAP could have a more subtle but pervasive 
impact on science, says Ito. Historically, young 
researchers in Japan have been able to take an 
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experiment in a different direction from their 
supervisor's, and to follow their hunches down 
often productive paths. But the fear of fraud 
is likely to lead to stricter internal auditing, 
including the checking of lab notebooks, so 
“now they'll be afraid to. That will affect their 
ability to be innovative and they will be less 
motivated,” says Ito. “That will be the biggest 
impact going forward. It is ironic, she says, 
that the experience of the CDB, which was 
born out of the desire to encourage young 
scientists, could end up stifling them. 
Steneck says that it is useful to assess 
whether the STAP case was handled in the 
best way. He says that RIKEN responded well 
when the issue came to light: “They faced up 
to the problem quickly and brought in other 
opinions.” But he questions the make-up of 
the Kishi committee, which was dominated by 
scientists. He thinks that a misconduct inves- 
tigation should be led by experts in studying 
behaviour in scientific and research institu- 
tions. “Having a committee of non-experts 
gather their own evidence does not work” 
Paul Taylor, director of the Office for 
Research Ethics and Integrity at the University 
of Melbourne in Australia, says that a Japanese 
investigatory organization akin to the US Office 
of Research Integrity might have helped. “It 
provides an independent source of expertise in 
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conducting investigations, and perhaps ensures 
that trust in research is maintained,” he says. 
He adds that Japan's tightened requirements on 
data management and research-integrity train- 
ing might help to prevent scientific misconduct 
in the future — but there are limits. “If someone 
is able to justify to themselves why it is OK to 
fabricate an image, or steal someone's work, 
then they will do that even in the presence of 
the best research-integrity environment” 

Hamada is optimistic that the CDB can move 
on. He hopes to negotiate with the science 
ministry to restore researchers’ budgets within 
three years, and plans to continue the CDB’s 
mission of gambling on promising young scien- 
tists. “I have to reorganize, refresh,” he says. “My 
job is to forget about what happened” 

That will not be easy for Fujiwara. He was 
just starting to get good data — but is worried 
that he will have trouble getting funding for 
the mouse experiments that he needs to even 
publish his work so far. “It was going to be an 
important year for us,’ he says. Now, he just 
hopes that his science can survive. = 


David Cyranoski is a reporter for Nature 
based in Shanghai, China. 


1. Obokata, H. et al. Nature 505, 641-647 (2014). 
2. Obokata, H. et al. Nature 505, 676-680 (2014). 
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A 2013 protest against the Syrian government pays tribute to those who died in explosions at the University of Aleppo earlier that year. 


After the 


Four years after revolutions shook governments in North Africa 
and the Middle East, scientists face an uncertain future. 


sa young physicist in Tunisia, Imen 

Sfar gave little thought to politics 

until March 2010, when a street 

vendor set himself on fire to pro- 
test about corruption in the city of Monastir, 
where Sfar worked. 

Two months later, an influential blogger 
and close friend of Sfar’s was arrested and 
interrogated for helping to organize a dem- 
onstration opposing Internet censorship in 
Tunisia. His detention gave Sfar the “cour- 
age factor’, she says, to join the revolution 
that erupted in December 2010, after another 
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street vendor set himself aflame in Sidi Bouzid 
to protest against harassment by local officials. 
That act unleashed years of pent-up frustra- 
tions against the repressive government of 
President Zine al-Abidine Ben Ali, who had 
ruled the country for almost a quarter of a 
century (see Nature 469, 453-454; 2011). 
“Events unfolded very quickly, with people 
taking to social media and the streets,” recalls 
Sfar. “It was what I call the abolition of fear; we 
no longer were scared of anything” 

The uprising triggered a wave of unrest that 
swept across North Africa and the Arabian 
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Peninsula. Demonstrations and revolts 
destabilized long-standing regimes, ultimately 
toppling rulers in Tunisia, Libya, Egypt and 
Yemen. 

Nearly five years after the Arab Spring, the 
political situation in most of those countries 
remains volatile and many scientists are strug- 
gling more than they did before. This week, 
Nature profiles how researchers’ situations 
have changed in Tunisia, Egypt and Syria, 
which have followed starkly different politi- 
cal trajectories. 

In Tunisia, scientists are celebrating the 
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country’s successful transition to democracy, 
although the hard-won political freedoms 
have not yet translated into changes in the 
research system. In nearby Egypt, many of 
the freedoms won through the revolution 
have been reversed and the status of scientists 
has improved little. Syria has deteriorated the 
most, spiralling into an all-out civil war that 
has driven most scientists to flee the country. 
Yet even amid the chaos, some researchers 
have remained in Syria, where they struggle 
to teach students and carry on their research 
in any way possible. 

“T do not fear death, it will happen anywhere 
you go — but many students in Syria need our 
help,” says Ahmad Almansour, a materials 
engineer at the University of Aleppo. 


Before the Arab Spring, Tunisia had a long 
tradition of support for education and 
research, unlike many ofits neighbours. It pro- 
duced more papers, relative to the size of its 
population, than any other Arab nation apart 
from Saudi Arabia, and it was the only one 
to invest more than 1% of its gross domestic 
product (GDP) in research and development. 

But the repressive government kept tight 
control of university policies and stifled aca- 
demic freedom, especially in disciplines that 
delve into potentially controversial territory, 
such as the social sciences. Faouzia Charfi, a 
retired physicist and veteran opponent of the 
regime’s human-rights restrictions, tells how 
researchers needed permission for almost 
everything, even organizing academic con- 
ferences and collaborating with foreign peers. 

The regime feared people acting on their 
own initiative, says Charfi, who became junior 
minister for higher education in the first post- 
revolution transition government. Universities 
and researchers had little freedom to develop 
their own policies or strategies. And regime 
bureaucrats blocked attempts to build links 
between universities and industry, stifling the 
kind of innovation that builds economies and 
creates jobs, she says. 

The police state established by Ben Ali 
affected most aspects of everyday life. Few 
dared to criticize the government or even to 
discuss politics, and people could never be 
sure that neighbours or friends were not part 
of the regime’s web of informers, says Sfar. 
Instead, they took refuge in the safe subject of 
sport. “Before the revolution, Tunisians dis- 
cussed nothing but football,” she says. 

After Ben Ali fled Tunisia in January 2011, 
freedom of expression exploded overnight, 
says Sfar. But the euphoria soon dimmed as 
the country went through four years of suc- 
cessive governments and political turmoil, 
including the assassination of leading oppo- 
sition politicians. At times, many feared that 
the democratic gains of the revolution would 
be reversed. But Tunisia’s robust civil society 
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Physicist Imen Sfar is optimistic that newfound freedoms will translate into better research in Tunisia. 


kept the revolution on track, with huge street 
demonstrations and strikes. After the moderate 
Islamist party Ennahda reached a compromise 
with secular parties, Tunisia last year became 
the first and only Arab Spring country to 
successfully transition to democracy. 

In a historic moment, parliament over- 
whelmingly passed a constitution that guar- 
anteed free speech, freedom of expression, 
religious freedom and equality between the 
sexes. The constitution also explicitly protects 
scientific and academic freedom, and man- 
dates that the state “supply the means neces- 
sary for the development of scientific research 
and technology”. In October, a secularist 
party won the parliamentary elections, and 
in December its candidate Beji Caid Essebsi 
was elected as the nation’s president. 

The biggest change since the revolution, says 
Sfar, “is this kind of feeling of freedom to say 
what we think about politics and the admin- 
istration without fearing serious repercus- 
sions”. At the professional level, she says that 
the revolution has had little impact on her field 
of condensed-matter physics and she has been 
preoccupied with starting her career and a fam- 
ily. She has taken a position at the University of 
Tunis El Manar, and is contemplating switching 
to theoretical work on materials with unusual 
electrical and magnetic properties because she 
lacks access to the kind of equipment she used 
while completing her PhD in France. 
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More broadly, the dramatic changes in 
Tunisian society have yet to permeate the 
research and higher-education systems. The 
past quarter ofa century of authoritarian rule 
has left a system that is far from realizing its 
potential, says Hechmi Louzir, director of the 
Pasteur Institute in Tunis. The regime's tight 
grip on free thinking stifled creativity, he says. 

“People didn't stray from the beaten path,” 
says Louzir. Sfar says that even now, many of 
her students do not seem keen to think for 
themselves, and they expect lecturers “to show 
them one, and only one, road to the truth”. 

And Charfi is disappointed that research- 
ers have not yet taken advantage of their new 
freedoms to bring in sweeping reforms, attrib- 
uting the reticence to the highly conservative 
culture of the country’s scientific community. 

Educational reform and innovation in 
science are important, say researchers, 
because they will help Tunisia to create wealth 
and jobs — a priority given the country’s dis- 
mal economic state and high unemployment. 
And Tunisians are ever mindful of local 
threats; just last month radical Islamist mili- 
tants killed 22 people in a terrorist attack on 
the Bardo National Museum in Tunis. 

Still, Tunisian researchers point to some 
positive changes. In the past, political nepotism 
and influence often determined who held key 
university posts, but democratic elections now 
decide, they say. 
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As Sfar looks back over the changes, she sees 
grounds for optimism. “There is now freedom 
of expression and thought, and an extraordi- 
narily active civil society,” she says. 


EGYPT 


After a citizen uprising toppled the regime of 
President Hosni Mubarak in February 2011, 
microbiologist Ramy Aziz returned to Egypt 
full of hope for change. 

“I had wanted to come back to Egypt to 
bring the scientific experience I gained in 
bioinformatics back to Cairo University,’ says 
Aziz, who was at San Diego State University 
in California at the time. “The revolution was 
the spark that changed everything overnight.” 

Students, researchers and professors took 
part in the uprising and their expectations 
soared in the months that followed. The 
government promised to increase science 
funding and many expatriate researchers 
vowed to return home. 

Researchers called for reforming outdated 
policies — such as requirements that slowed 
down the transfer of samples across borders or 
limited interactions with industry. Such changes, 
they said, would help Egypt to target its most 
pressing problems: water and energy insecurity, 
poverty and mounting unemployment. 

But the high hopes of Aziz and others have 
yet to be fulfilled. The government approved a 
doubling in science funding in 2012, but there 
was no clear agenda on how to use those funds 
and the research ministry failed to spend more 
than 80% of its budget. “On the ground we did 
not feel any change in the budget really,” says 
Aziz, who is now at Cairo University. 

Then, on 3 July 2013, a popularly backed 
coup removed the first democratically elected 
president, Mohamed Morsi, who represented 
the Muslim Brotherhood. Several of the gains 
that researchers won after the uprising have 
since been reversed. In June last year, for 
example, newly elected president Abdel Fattah 
al-Sisi scrapped the law introduced after the 
uprising that allowed professors to choose 
their university’s leaders. The positions are 
now appointed by the president (see Nature 
511, 5; 2014). 

“From a scientific point of view, I can’t see 
any change in Egypt,’ says Aziz. He does feel 
that university students and postgraduates have 
become bolder and more willing to challenge 
their professors and ask for their rights — a 
positive outcome. But he does not expect that to 
make a big difference in Egypt's science. 

The biggest win for scientists is consti- 
tutional, says Alaa-Eldin Adris, who chairs 
the department of petroleum and energy 
engineering at the American University in 
Cairo. The new constitution, passed in Janu- 
ary 2014, states that 1% of GDP should be 
spent on scientific research, and that this can 
be increased up to international standards. 
“On paper this looks good,” Adris says. But 
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he cautions that the | Ramy Aziz had hoped 
constitutional change __ that the ousting of 

is only the first step. President Mubarak 
“Tt is worthless unless | would make science 
itcreateslawsthatare easier to do in Egypt. 
enforced.” 


According to the World Bank, science 
spending in Egypt rose from 0.24% of GDP 
in 2009 before the uprising to 0.43% in 2011. 
But most of the increase goes to salaries rather 
than to funding research. In the Global Com- 
petitiveness Report 2014-2015, produced by 
the World Economic Forum, Egypt ranks 
among the worst ten countries in the world in 
the quality of its scientific research institutes. 

“We have many researchers, which is good, 
but when it comes to using science to solve 
problems or drive economic growth, we are 
almost at zero,” says Adris. 

Aziz says that working life in Egypt today is 
a continuous battle in a stilted system, where 
he lacks access to core facilities and an envi- 
ronment that supports research. In Egypt, 
he focuses on teaching and mentoring post- 
graduates, but he returns to the United States 
for a few months a year to conduct most of his 
experiments. When he works there, primers 
for DNA synthesis arrive a day after he orders 
them. “In Egypt, you get primers in three 
weeks. If they don’t work well, you will need 
another three weeks,” he says. 

Aziz thinks that the current political leader- 
ship is sincere in its goals to support science 
but has done little to actually help researchers. 
Others, however, accuse the regime of taking 
a strong stand against freedom of expres- 
sion within universities. Students have been 
arrested during protests on the campuses 
of Cairo’s Al- Azhar University and at Cairo 
University and security forces have frequently 
stormed the university gates, leading to violent 
confrontations with protesters. 

Many young people who might otherwise 
have helped to build up Egyptian science are 
being driven out. Islam Kotb left after he lost 
many friends in a violent dispersal of pro- 
testers in August 2013. “Leaving the country 
became my ultimate aim at that time and I 
accepted the first job offer I got,” he says. 
Now working as a critical-care specialist in 
Jeddah, Saudi Arabia, he hopes to return to 
Egypt one day, but stresses “that scientific 
research can never succeed and flourish in 
such an environment”. 

For Aziz and Adris, the real threats to 
research in Egypt are the bureaucracy (buying 
chemicals can take several months, for exam- 
ple) and the government’ failure to engage in 
serious reform, such as overhauling education 
in ailing public schools and universities. None 
of these problems has been addressed by any 
of the successive ruling regimes, they say. 

“T personally expect that by 2020 there will 
be little change, if any, in scientific research 
and output,” says Aziz. 

Yet even with all these problems, he still 
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finds hope. “I see excellent students trying to 
learn against insurmountable odds,’ he says. 
“You need to create a new generation with a 
different mindset and proper education that 
will eventually replace the current people in 
control, with their outdated vision” 


SYRIA 


In January 2013, students were gathering 
for their mid-year examinations at the Uni- 
versity of Aleppo when a series of rockets hit 
residence halls and other buildings. Smoke 
billowed across the campus and people ran in 
panic to find shelter. The attack killed at least 
82 people, including students and refugees 
who had sought protection at the university. 
The rebels and the government blamed each 
other for the bombing. 

Materials engineer Almansour was lucky 
that day. The blast blew out the windows 
of the building where he was working, 
300 metres from the centre of the explo- 
sion, but he and his family were unharmed. 
Although most Syrian scientists have fled 
the country to escape the ongoing civil war, 
Almansour feels an obligation to his students 


SABRY KHALED 


and is staying put. “If we all leave the country, 
who is going to teach them?” he asks. 
Whereas Tunisia and Egypt have started to 
stabilize politically, the situation in Syria has 
grown steadily worse. What started as an upris- 
ing against lack of freedom and an oppressive 
autocratic regime has turned into a civil war 
with many factions and little hope of resolution. 
Throughout the four years of turmoil, 
however, some education and research have 
continued. When Michel Rahal graduated in 
2013 from Damascus University with a bach- 
elor’s in applied chemistry, he had hoped to 
pursue a postgraduate degree in Europe. But 
the government eliminated almost all schol- 
arships for overseas education, and in any 
case, Rahal felt a responsibility to stay in his 
country during these challenging times. He 


is now studying for a master’s degree at the 
Higher Institute of Applied Science and Tech- 
nology in Damascus, although he feels that 
his options are limited while in Syria. 

Some of those who have left say that it was 
impossible for them to remain in the country. 
Ahmad Salman, who asked to have his real 
name changed for fear of retribution against 
his family still in Syria, is one of those who 
tried to stay as long as possible. “Despite the 
difficulties of life in Syria during the war with 
the lack of security, access to water or fuel and 
the huge inflation, I lived with the hope that 
the revolution would end in favour of the peo- 
ple,” he says. But when the regime passed a 
law drafting all men into the army, he left his 
position at Al-Baath University and fled to 
Turkey, where he is now jobless. “I only had 
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two choices, either join the military and be 
on the front lines to kill or be killed by my 
countrymen, or get out of Syria.” 

Most international researchers in Syria 
have also fled the violence, and that has 
impaired research in the country. The Inter- 
national Center for Agricultural Research in 
the Dry Areas (ICARDA), one of the biggest 
research centres in Syria, struggled for almost 
two years to keep its doors open before leav- 
ing the country for neighbouring Lebanon 
in late 2012. Mahmoud Solh, a geneticist and 
director-general of ICARDA, says that loot- 
ers had repeatedly attacked the facilities and 
stolen vehicles, computers and other equip- 
ment. But the centre managed to save its agri- 
cultural gene bank, one of the most important 
such collections, before it relocated. 

Researchers who remain in Syria say that 
the working conditions there are extremely 
difficult. “Students in the medical and engi- 
neering school need equipment and materials 
that they cannot get,’ says Zeina Al-Ahmad, 
a finance specialist at Tishreen University in 
Latakia. “We cannot participate in conferences 
as we used to either, which has also affected 
the quality of research in Syria.” The high 
inflation rate, which peaked at 121% in August 
2013, has hit everyone hard, making it nearly 
impossible for scientists to fund their research 
and publishing activities. 

And security remains a major concern for 
students and faculty members, who must 
often pass through dangerous war zones to 
reach their universities. Almansour lives close 
to his university's campus, but many students 
have had to drop out for fear of their safety. 
“The number of students attending classes is 
about 35-40% of the registered number,” he 
says. “Iam most worried about the possible 
destruction of the infrastructure. It took usa 
long time to build campuses, but if the fight- 
ing spares universities and schools then we 
will be able to go on in spite of the terrible 
circumstances.” 

As bad as conditions are in Aleppo and 
other areas controlled by the government, 
the situation can be even worse for those 
elsewhere. The extremist group ISIS has 
taken control of large areas of eastern Syria, 
where it has closed schools and universities 
and is forcing teachers to work in its newly 
founded schools or lose their lives, impelling 
many professors and teachers to flee. There 
are few signs that things are likely to improve, 
with ISIS and other groups now involved in 
the civil war. 

Given all the problems, Rahal plans to 
move to Europe as soon as he can. Many other 
researchers are also leaving, says al- Ahmad. 
“The young generation is trying to finish 
university and find a job abroad.” m 


Mohammed Yahia is chief editor of Nature 
Middle East. Declan Butler is a senior 
reporter for Nature in France. 
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Time for one-person trials 


Precision medicine requires a different type of clinical trial that focuses on 
individual, not average, responses to therapy, says Nicholas J. Schork. 


medications that will not help them. 

The top ten highest-grossing drugs 
in the United States help between 1 in 25 
and 1 in 4 of the people who take them (see 
‘Imprecision medicine’). For some drugs, 
such as statins — routinely used to lower 
cholesterol — as few as 1 in 50 may benefit’. 
There are even drugs that are harmful to 
certain ethnic groups because of the bias 
towards white Western participants in 
classical clinical trials”. 

Recognition that physicians need to take 
individual variability into account is driving 
huge interest in ‘precision’ medicine. In Janu- 
ary, US President Barack Obama announced .a 


B= day, millions of people are taking 


US$215-million national Precision Medicine 
Initiative. This includes, among other things, 
the establishment ofa national database of the 
genetic and other data of one million people 
in the United States. 

Classical clinical trials harvest a handful 
of measurements from thousands of people. 
Precision medicine requires different ways 
of testing interventions. Researchers need 
to probe the myriad factors — genetic and 
environmental, among others — that shape 
a person's response to a particular treatment. 

Studies that focus on a single person — 
known as N-of-1 trials — will be a crucial 
part of the mix. Physicians have long done 
these in an ad hoc way. For instance, a doctor 
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may prescribe one drug for hypertension and 
monitor its effect on a person's blood pres- 
sure before trying a different one. But few 
clinicians or researchers have formalized this 
approach into well-designed trials — usually 
just a handful of measurements are taken, and 
only during treatment. 

If enough data are collected over a 
sufficiently long time, and appropriate control 
interventions are used, the trial participant 
can be confidently identified as a responder 
or non-responder to a treatment. Aggregated 
results of many N-of-1 trials (all carried out in 
the same way) will offer information about 
how to better treat subsets of the population 
or even the population at large. > 
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> Formalizing and scaling up the N-of-1 
approach means solving various practical 
problems. These include exploiting the diver- 
sity of health-monitoring devices, developing 
new ones and identifying appropriate disease 
biomarkers, such as tumour DNA circulat- 
ing in the bloodstream. It will also require a 
cultural shift on many levels — in regulatory 
agencies, in pharmaceutical companies and, 
most ofall, in the clinic. 


A WORLD OF DIFFERENCE 

Discovering that an intervention works well 
in certain groups happens relatively rarely 
and often by chance. Researchers typically 
get disappointing results with a drug in large, 
population-based trials. This leads them to 
conduct ad hoc post-trial analyses, to try to 
identify the factors that cause some of the 
people in the trial to seem to be responsive’. 

For instance, the drug Gleevec (imatinib) 
was found to double survival rates of leu- 
kaemia patients* with a chromosomal 
abnormality in their tumours called the 
Philadelphia translocation. Similarly, it turns 
out that Erbitux (cetuximab) improves the 
survival of people with colorectal cancer 
whose tumour cells carry a mutated EGFR 
gene but not a mutated KRAS gene’. 

This approach to discovery is inefficient 
at best. Conventional phase III trials involve 
thousands of people. The intervention being 
tested is often given at random to one group 
while another group receives a sham treat- 
ment, such as a sugar pill or the standard 
treatment that physicians would give such 
patients. Because scant data are collected on 
factors such as genetics, lifestyles and diets, 
the results of these trials often indicate the 
need for yet another study to validate the 
effectiveness of the intervention among the 
apparent responders and to establish the 
underlying mechanisms. 

Various trial designs have been developed 
that better account for variability between 
patients. Basket trials, which have mainly 
been used for cancer, test the effective- 
ness of an intervention on the basis of its 
mode of action, regardless of what disease 
it was designed to treat. For instance, the US 
National Cancer Institutes MATCH Trial, 
expected to launch in May, will use genetic 
markers from tumours to assign 1,000 people 
who have different types of late-stage cancer 
to different treatment ‘baskets’ Baskets will 
include around 30 patients, all of whom share 
certain genetic anomalies; participants will be 
given one of around 25 drugs. The effective- 
ness of the different matches of treatment to 
genetics will be compared as the trial unfolds. 

In umbrella trials, researchers test the effec- 
tiveness of multiple drugs in a study of a sin- 
gle disease. This approach is being used in the 
ongoing Stand Up To Cancer and Melanoma 
Research Alliance Dream Team clinical trial, 
in which I am involved, run by Yale University 
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IMPRECISION MEDICINE 


For every person they do help (blue), the ten highest-grossing drugs in the 
United States fail to improve the conditions of between 3 and 24 people (red). 


1. ABILIFY (aripiprazole) 


Schizophrenia 
fe f 


3. HUMIRA (adalimumab) 
Arthritis 


4 
tt 


5, CYMBALTA (duloxetine) 


Depression 


8. REMICADE (infliximab) 


Crohn’s disease 
fe f 
fi ‘ 


Based on published number needed to treat (NNT) figures. For a full list of re 


2. NEXIUM (esomeprazole) 
Heartburn 


4. CRESTOR (rosuvastatin) 
High cholesterol 


6. ADVAIR DISKUS (fluticas 
Asthma 


Multiple sclerosis 


in New Haven, Connecticut, and the Transla- 
tional Genomics Research Institute in Phoe- 
nix, Arizona. We are giving genomically 
guided treatment to around 70 people out of 
100 with late-stage melanoma — choosing 
from an array of 40 or so drugs (the rest of the 
participants are receiving the standard treat- 
ment usually given to such patients). Here, the 
effectiveness of matches between drugs and 
genetics will be compared with standard care. 
Lastly, adaptive trials aim to match interven- 
tions to patients while the study is ongoing, 
on the basis of patient responses. 

Even these trial designs may not be person- 
alized enough. Among people who share, say, 
a particular mutation known to be targeted 
by a specific drug, many other factors can 
contribute to any one person's responsive- 
ness. This is particularly true for those with 
cancer. The drug vemurafenib, for instance, 
was approved in the United States to treat late- 
stage melanoma in people whose tumours 
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9. COPAXONE (glatiramer acetate) 


one propionate) 


7. ENBREL (etanercept) 


Psoriasis 
e e® @ 
e @ 


10. NEULASTA (pegfilgrastim) 


Neutropenia 


ferences, see Supplementary Information at go.nature.com/4dr78f. 


carry the BRAF(V600E) mutation. But some 
tumour cells develop other anomalies that 
make them resistant to the drug. Thus clini- 
cians considering whether to prescribe vemu- 
rafenib may need to take into account a whole 
slew of biomarkers. 


COMPARING TREATMENTS 

In N-of-1 trials, all sorts of relevant data 
will need to be collected for one person, 
as frequently as possible — perhaps every 
day or periodically over months or years. 
The usual design and statistical safeguards 
could be employed, such as blinding patients 
and experimenters to the drugs being tested, 
and the use of control interventions (such as 
periods of standard care). In addition, appro- 
priate crossover designs, in which different 
interventions are administered to the same 
person alternately (possibly with ‘wash-out’ 
periods in between to allow the drugs effects 
to wear off) would enable experimenters to 


compare the effect of different treatments in 
the same person. 

If done properly, claims about a person’s 
response to an intervention could be just 
as well supported by a statistical analysis as 
by analyses designed to assess population- 
level responses on the basis of classical 
clinical trials®. 

An example of this approach is a study’ 
in Australia, which measured reported pain 
levels, swelling and other symptoms associ- 
ated with osteoarthritis and chronic pain in 
132 people taking different drugs over three 
years. For each person, measurements were 
taken every 2 weeks for 12-week periods, 
when the patient was either off or on a par- 
ticular drug. By comparing the data collected 
before and after the different treatments, the 
researchers showed that, although initially 
costly, the formalized N-of-1 trials resulted 
in more-effective prescriptions. 

Sometimes N-of-1 trials will be neither 
appropriate nor feasible. For instance, the 
costs are probably too high for public-health 
studies that investigate the effect of a pop- 
ulation-wide intervention, such as adding 
fluoride to drinking water. Making objective 
claims about individual responses requires 
taking appropriate measures (of tumour 
progression, say) repeatedly and efficiently. 
Yet it is not always clear what to measure. 
Only a fraction of the thousands of proposed 
biomarkers have been shown to be useful in 
the clinic. 

But in many instances, an N-of-1 
approach is ideal. Such studies are already 
being done for some rare diseases by neces- 
sity, but often without the use of sophisti- 
cated trial designs and without necessarily 
collecting the appropriate information to 
make hypotheses about the drug’s mecha- 
nism. Many experimental drugs are also 
administered in ‘compassionate use’ settings. 
And many widely used drugs are provided 
to combat diseases for which they were not 
approved (‘off label’ prescription), for peo- 
ple who fail to respond to all other treat- 
ments. Examples include uses of the drug 
mexiletine to treat the rare muscle disease 
non-dystrophic myotonia, and experimental 
treatments for the Ebola virus. 

Well-designed N-of-1 trials could also 
be useful in the early stages of clinical drug 
development or in repurposing drugs — 
for exploring the molecular and physi- 
ological effects of anew compound (or of 
an old compound in a new context). Like- 
wise, studies investigating the safety and 
appropriate dosages of drugs could take an 
N-of-1 approach. Currently, phase I and II 
clinical trials usually involve giving different 
amounts of a drug to a small group of healthy 
volunteers. Better would be to tailor dosages 
to individuals’ metabolic profiles. 

N-of-1 trials could be designed to guide 
clinicians in detecting disease onset. For 


instance, US physicians generally view levels 
of a blood protein called CA125 greater than 
30 or 35 as an indication of ovarian cancer. 
However, a level of 20 or 25 may be a cause 
for concern if the person’s average CA125 
levels hovered around 10 or 15 over the pre- 
vious year*. Establishing personal thresholds 
for uncovering disease onset is the goal of the 
registered clinical trial known as the Tanner 
Project (www.tannerproject.org), in which 
Iam involved. 

By looking for commonalities across 
multiple N-of-1 studies — in which the 
same types of data 


are collected using “A key 

the same procedures Component 

— researchers should willbe 

beable todrawinfer- transforming 
ences about the everyday 
effectiveness of an clinical care 
intervention incer- infosolid 


tain subsets of the 
population, such asin 
people sharing particular genetic features, as 
well as in the whole population. 

Various teams are developing and testing 
algorithms to match interventions, or a com- 
bination thereof, to individuals on the basis 
of their genetic make-up, biochemistry, diet 
and other factors. For instance, matching 
drugs to tumour profiles is a key goal of the 
Stand Up To Cancer umbrella trial. 


N-of-1 trials.” 


MAKING IT HAPPEN 
There are significant barriers to mak- 
ing N-of-1 trials commonplace. Regula- 
tory agencies, researchers and clinicians 
are rightfully wary of moving away from 
classical clinical trials. Pharmaceutical 
companies tend to focus on drugs that are 
likely to be used by thousands or millions of 
people. What is more, tailoring treatments 
to patients is costly. For example, the can- 
cer-care company Foundation Medicine in 
Cambridge, Massachusetts, charges patients 
between US$5,000 and $7,500 to sequence 
their tumours and to use the results to advise 
on treatments. And there is a lot of work to 
be done on biomarkers, monitoring devices, 
study designs and data-analysis methods. 
A key component will be transforming 
everyday clinical care into solid N-of- 1 trials. 
In my view, the time is ripe for three reasons. 
First, there is a growing interest in ‘omics’ 
assays that expose people's unique character- 
istics at the molecular level. Researchers and 
clinicians are assaying people’s blood metab- 
olites (their metabolome) and the microbes 
in their bodies (their microbiome) as well 
as their DNA and RNA’. Second, cheap 
and efficient devices that collect health data 
are becoming available, such as the Apple 
Watch, continuous glucose monitors and 
portable electroencephalogram (EEG) mon- 
itors. Lastly, governments and life-sciences 
funding bodies worldwide are increasingly 
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supporting a more targeted approach as well 
as patient engagement in medicine, such as 
through the US Patient-Centered Outcomes 
Research Institute, established in 2010. 

I am confident that, ultimately, govern- 
ments, regulatory agencies and pharmaceu- 
tical companies will support sophisticated, 
well-designed N-of- 1 trials. These could save 
the millions of US dollars that are spent on 
inappropriate interventions, the manage- 
ment and treatment of persistent or recur- 
ring diseases, and on conventional phase II 
trials (which can cost between $100 million 
and $700 million per drug). Regulatory 
agencies such as the US Food and Drug 
Administration are beginning to recognize 
the importance of individual responses”. 
And sufficient financial or market incentives 
provided by governments could persuade 
pharmaceutical companies to broaden their 
focus away from ‘blockbuster’ drugs — espe- 
cially given the poor rates of return on drug 
discovery in recent years. 

Key to making precision medicine main- 
stream is the ongoing shift in the relationship 
between patients and physicians. A major 
advantage of the N-of-1 approach over clas- 
sical trials is that patients are no longer guinea 
pigs, whose involvement in a study may help 
only future generations. In N-of-1 trials, the 
effectiveness of different treatments are vetted 
for the actual participants. Indeed, members 
of hundreds of patient-advocacy groups, for 
instance for rare genetic diseases, are eager to 
be involved in studies to test candidate drugs. 

Physicians are having to become more 
acutely aware of the unique circumstance of 
each patient — something most people have 
long called for. m 
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COMMENT 


P-values are just the tip 
of the iceberg 


Ridding science of shoddy statistics will require scrutiny of every step, 
not merely the last one, say Jeffrey T. Leek and Roger D. Peng. 


here is no statistic more maligned 

than the Pvalue. Hundreds of papers 

and blogposts have been written 
about what some statisticians deride as ‘null 
hypothesis significance testing’ (NHST; see, 
for example, go.nature.com/pfvgqe). NHST 
deems whether the results of a data analysis 
are important on the basis of whether a 
summary statistic (such as a P value) has 
crossed a threshold. Given the discourse, it 
is no surprise that some hailed as a victory 
the banning of NHST methods (and all of 
statistical inference) in the journal Basic 
and Applied Social Psychology in February’. 

Such a ban will in fact have scant effect 
on the quality of published science. There 
are many stages to the design and analysis 
of a successful study (see “Data pipeline’). 
The last of these steps is the calculation of 
an inferential statistic such as a P value, and 
the application ofa ‘decision rule’ to it (for 
example, P<0.05). In practice, decisions 
that are made earlier in data analysis have 
a much greater impact on results — from 
experimental design to batch effects, lack 
of adjustment for confounding factors, or 
simple measurement error. Arbitrary levels 
of statistical significance can be achieved by 
changing the ways in which data are cleaned, 
summarized or modelled’. 

P values are an easy target: being widely 
used, they are widely abused. But, in prac- 
tice, deregulating statistical significance 
opens the door to even more ways to game 
statistics — intentionally or unintentionally 
— to get a result. Replacing P values with 
Bayes factors or another statistic is ultimately 
about choosing a different trade-off of true 
positives and false positives. Arguing about 
the P value is like focusing on a single mis- 
spelling, rather than on the faulty logic ofa 
sentence. 

Better education is a start. Just as anyone 
who does DNA sequencing or remote- 
sensing has to be trained to use a machine, 
so too anyone who analyses data must be 
trained in the relevant software and con- 
cepts. Even investigators who supervise data 
analysis should be required by their funding 
agencies and institutions to complete train- 
ing in understanding the outputs and poten- 
tial problems with an analysis. 

There are online courses specifically 
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DATA PIPELINE 


The design and analysis of a successful study 
has many stages, all of which need policing. 


Inference 


i 
Little debate 
| 5 


Summary statistics 


Statistical modelling 


Potential statistical models 


Exploratory data analysis 


Tidy data 


i 


Data cleaning 


Raw data 


Data collection 


Experimental design 


designed to address this crisis. For 
example, the Data Science Specialization, 
offered by Johns Hopkins University in 
Baltimore, Maryland, and Data Carpen- 
try, can easily be integrated into training 
and research. It is increasingly possible to 
learn to use the computing tools relevant 
to specific disciplines — training in Bio- 
conductor, Galaxy and Python is included 
in Johns Hopkins’ Genomic Data Science 
Specialization, for instance. 

But education is not enough. Data 
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analysis is taught through an apprenticeship 
model, and different disciplines develop 
their own analysis subcultures. Decisions 
are based on cultural conventions in spe- 
cific communities rather than on empirical 
evidence. For example, economists call data 
measured over time ‘panel data, to which 
they frequently apply mixed-effects models. 
Biomedical scientists refer to the same type 
of data structure as ‘longitudinal data, and 
often go at it with generalized estimating 
equations. 

Statistical research largely focuses on 
mathematical statistics, to the exclusion of 
the behaviour and processes involved in 
data analysis. To solve this deeper problem, 
we must study how people perform data 
analysis in the real world. What sets them up 
for success, and what for failure? Controlled 
experiments have been done in visualiza- 
tion’ and risk interpretation’ to evaluate 
how humans perceive and interact with data 
and statistics. More recently, we and others 
have been studying the entire analysis pipe- 
line. We found, for example, that recently 
trained data analysts do not know how to 
infer P values from plots of data’, but they 
can learn to do so with practice. 

The ultimate goal is evidence-based data 
analysis®. This is analogous to evidence- 
based medicine, in which physicians are 
encouraged to use only treatments for which 
efficacy has been proved in controlled trials. 
Statisticians and the people they teach and 
collaborate with need to stop arguing about 
P values, and prevent the rest of the iceberg 
from sinking science. = 
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CLIMATE ECONOMICS 


PLLAANALA 
FLL LL LS A 


The high road 


Michael Grubb is both swept away and frustrated by 
Nicholas Stern’s argument for tackling climate change. 


} ] icholas Stern’s Why Are We Waiting? 
is an important but frustrating 
book. It is important because it 

embodies the thinking of probably the most 

influential person at the nexus of climate 
change, economics and international policy. 

It is frustrating because it does not answer 

the titular question. It presents compelling 

economic logic for urgent action on climate 
change, but fails to get to the heart of what 
stands in the way. 

Stern, who was chief economist at the 
World Bank in the early 2000s, produced the 
hugely influential 2006 Stern Review on the 
Economics of Climate Change while head of 
the UK Government Economic Service. A 
crucial achievement of the report was Stern’s 
framework, which bridged 15 years of appar- 
ent impasses between the dominant scientific 
and economic perspectives. It offered a cost- 
benefit structure that agreed with the scien- 
tists, that strong action was needed to counter 
anthropogenic global warming. It ignited 
debate on climate change throughout the 
economics community, which until then had 
considered the subject of marginal interest. 
Nine years on, Stern concludes in Why Are 
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We Waiting? that the 
case for action is now 


Why 
even more urgent. aged 
The book is aston- .. We 
etl 


ishingly broad in scope 
and vision. Its core 
argument is that the 
next two decades will Nicholas Seen, 
be decisive, as a critical Vy 
period for emissions Why Are We 
coincides with mas-  Waiting?: The 
sive structural change Logic, Urgency, 
in the world economy. ane Eros iesot 
sec Tackling Climate 

These are conjoined Change 
challenges, and Stern — \\\CHOLAS STERN 
has long argued that MIT Press: 2015. 
“high-carbon growth 
is likely to self-destruct”. How the economic 
changes are channelled will determine the 
choice “between peril and prosperity”. 

Stern's coverage of the science is taut and 
displays his eye for relevant dimensions of 
the big picture, linked to human realities. He 
writes, for instance, that unchecked green- 
house-gas emissions are likely to cause carbon 
dioxide concentrations to reach 750 parts per 
million of the atmosphere by 2100, resulting 
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in a median temperature increase of 4°C or 
more above preindustrial levels, which would 
transform Earth's “physical and human geog- 
raphy”. So “deserts, coastlines, rivers, rainfall 
patterns — the reasons we live where we do 
— would be redrawn”. Homo sapiens, he says, 
“has not experienced anything like this” Stern 
notes that the uneven distribution of tempera- 
ture increases could “correspond to average 
land temperatures as much as 4—10°C above 
the preindustrial’, and could trigger frequent 
summer heatwaves as severe as the Russian 
disaster of 2010, which killed 55,000 people. 
Stern endorses the goal of checking emis- 
sions by enough to allow a 50:50 chance of 
staying belowa 2°C increase, but the window 
for that is closing rapidly. Far from being a 
burden justified by the risks, however, he sees 
the endeavour as “very attractive”. The cen- 
trepiece of his argument — drawing heavily 
on last year’s New Climate Economy report, 
which he co-chaired — is the potential for a 
tsunami of innovation to engulf the fossil-fuel 
economy, galvanizing economic growth. 
However, Stern does not adequately delve 
into the fact that too much analysis, modelling 
and policy is still dominated by the idea that 
there is a natural ‘least-cost’ energy system, 
and that market forces would deliver it if we 
only corrected market failures. This reflects 
the inbuilt assumptions of neoclassical eco- 
nomics. But the broad sweep of economic 
processes and thought spans at least two other 
domains. One concerns the behavioural and 
organizational realities that underpin the 
huge potential to improve energy efficiency, 
and other ‘easy wins. The other is evolution- 
ary economics: understanding how complex 
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systems evolve and can be locked into tra- 
jectories by incumbent industries, unless 
strong pressures force a change in course. 
These strands of economics are crucial to 
understanding the energy—climate nexus 
and policies for transformation. The econom- 
ics mainstream is still largely in thrall to the 
idea that competitive markets drive innova- 
tion, but liberalization of the energy sector 
destroyed UK energy research and develop- 
ment. The energy literature explains why, but 
Stern hardly touches on this and thus misses 
achance to help to educate fellow economists. 
Stern’s real speciality is displayed most in 
his discussion of the ethics of how to weigh 
costs and benefits over time, as well as broader 
issues in moral and political philosophy. The 
first is technical, but is at the heart of the post- 
Stern review debate: prevalent economic tools 
‘discount almost anything more than a few 
decades ahead. Critics often cite philosopher 
David Hume to support arguments that we 
should weight impacts on future generations 
on the basis of ‘revealed preference’ in mar- 
kets. To these critics, Stern cites Hume on the 
need for “governors and rulers” to overcome 
the “impatience” of individuals. Stern maps 
out his case with a precision reminiscent of 
philosopher and mathematician Bertrand 
Russell. He demolishes economists who, try- 
ing to avoid explicit ethical debates, forget the 
ethical assumptions of their discipline. 
Towards the end, Stern distils his thoughts 
on the international climate scene. Although 
he overlooks some key respects in which a 
binding treaty — the Kyoto Protocol — was 
crucial in forming European climate policy, 


huge value remains in his wisdom and his 
experience of international debates. 
However, Stern’s coverage of lessons from 
economic history and theory is messier. It 
meanders through market failures, policy 
uncertainties, the European Union's emis- 
sions-trading system, the debate over reserves 
of fossil fuels that must be left unburnt to sat- 
isfy emissions targets, shale gas, the German 
nuclear phase-out, failed Chinese dams and 
more. There are trenchant points about six 
market failures and the six areas of policy 
required to address them, but little structure 
behind the tales, and no compelling narrative 
on how policy failures might have been over- 
come. Instead, Stern embarks on a demoli- 
tion of his critics. The integrated assessment 
models beloved of many technical analysts 
come in for particular ire. Stern views them 
as “simplistic attempts to shoehorn the deep 
and dynamic issues into inappropriate or 
narrow models’, noting that many “can be 
profoundly misleading’, and evoking John 
Maynard Keynes’ old punchline: “It is better 
to be roughly right than precisely wrong” 
Most seriously, Stern has misdiagnosed the 
major obstacles to climate action as a failure of 
policy-makers to understand the scale of the 
risks, to grasp that delay is dangerous or to 
appreciate the feasibility of and potential for 
low-carbon paths. These are not the root of 
the problem, and Stern never gets to it, despite 
providing much of the intellectual material. 
Most policy-makers accept the need for 
action. Ignorance, inattention and ideology 
have all played a part in the waiting game, but 
it also springs from microrealities of politics 
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and economics. Stern is a big thinker, used 
to the broad sweeps of economic develop- 
ment and global issues. But the crux is local 
people and businesses. Voters might like the 
idea of clean energy, but oppose wind farms 
next door; back emission reductions and pro- 
fess support for market-based solutions, but 
oppose increased energy prices. There is little 
on energy prices in Why Are We Waiting?, but 
climate policy in Brussels and Washington 
DC is concerned with little else. 

The real challenge in controlling climate 
change is many-layered, but must include 
attempts to align global need for transforma- 
tion with the old adage that all politics is local. 
A brief reference to border measures, to deter 
industries from migrating to escape carbon 
controls, signals a shift in Stern’s thinking; but 
the argument is not developed further. 

Stern deserves much credit for moving 
the economics debate on so far in a decade. 
But Why Are We Waiting? is a depressing 
reminder of the sheer size of the elephant. 
The complexities of climate change are 
beyond the ken of any individual, even such 
a big thinker as Stern. Developing the insti- 
tutions, intellectual frameworks and impact 
channels required for effective ‘solution sci- 
ence, matched to the sheer range and scale 
of the problems and opportunities, remains 
very much a workin progress. m 


Michael Grubb is professor of international 
energy policy and climate change at University 
College London, editor-in-chief of Climate 
Policy, and author of Planetary Economics. 
e-mail: m.grubb@ucl.ac.uk 


We are the 98% 


Nathaniel Comfort unpicks the metaphors in a trio of 
books exploring the ‘junk’-ridden genome. 


r | Vhe language of DNA is a veritable 

cornucopia of metaphor and cliché. 

Since James Watson and Francis 
Crick solved the double helix, biologists have 
imagined DNA as an information-storage 
device: magnetic tape, a computer program 
or, most commonly, a book that contains the 
instructions for making a cell’s proteins. In 
multicellular organisms, this precious tome 
is secured in the vault of the nucleus, the 
membrane of which isolates and protects 
nature from nurture. 

But if a genome is text, it is badly edited. 
Most DNA is gibberish, full of stutters, snip- 
pets of doggerel from other species, and ech- 
oes of quiescent viruses. In humans, only 


about 2% of the genome encodes proteins. 
Much — but not all — of the remaining 
98% is evolutionary detritus. In the 1960s, 
researchers learned that non-coding DNA 
can serve vital functions, such as regulating 
gene action and building ribosomes. The 
remainder they began to call junk. 

Today, junk DNA isat the heart of the most 
radical transformation of how we understand 
the genome since the information metaphor. 
Three books — The Deeper Genome by John 
Parrington, Junk DNA by Nessa Carey and 
Biocode by Dawn Field and Neil Davies — 
present a vision of the twenty-first-century 
genome. Their relative success hinges on 
metaphor and imagery, both in how they 
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Biocode: The New Age of Genomics 
DAWN FIELD AND NEIL DAVIES 
Oxford Univ. Press: 2015. 


The Deeper Genome: Why There Is More to 
the Human Genome Than Meets the Eye 
JOHN PARRINGTON 

Oxford Univ. Press: 2015. 


Junk DNA: A Journey Through the Dark 
Matter of the Genome 

NESSA CAREY 
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conceive the genome and in the writing itself. 

In September 2012, the ENCODE (Ency- 
clopedia of DNA Elements) consortium 
announced that its multi-year international 
effort to catalogue the various types of DNA 
sequence had assigned “biochemical func- 
tion” to 80% of the human genome. Incau- 
tious reporters began shouting that junk 
was bunk, even though scientific consen- 
sus maintains that most genomes contain 
large amounts of it. The subsequent debate 
upregulated public interest in non-coding 
DNA — but how do we talk about DNA now? 

The title Biocode forestalls any doubt > 
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systems evolve and can be locked into tra- 
jectories by incumbent industries, unless 
strong pressures force a change in course. 
These strands of economics are crucial to 
understanding the energy—climate nexus 
and policies for transformation. The econom- 
ics mainstream is still largely in thrall to the 
idea that competitive markets drive innova- 
tion, but liberalization of the energy sector 
destroyed UK energy research and develop- 
ment. The energy literature explains why, but 
Stern hardly touches on this and thus misses 
achance to help to educate fellow economists. 
Stern’s real speciality is displayed most in 
his discussion of the ethics of how to weigh 
costs and benefits over time, as well as broader 
issues in moral and political philosophy. The 
first is technical, but is at the heart of the post- 
Stern review debate: prevalent economic tools 
‘discount almost anything more than a few 
decades ahead. Critics often cite philosopher 
David Hume to support arguments that we 
should weight impacts on future generations 
on the basis of ‘revealed preference’ in mar- 
kets. To these critics, Stern cites Hume on the 
need for “governors and rulers” to overcome 
the “impatience” of individuals. Stern maps 
out his case with a precision reminiscent of 
philosopher and mathematician Bertrand 
Russell. He demolishes economists who, try- 
ing to avoid explicit ethical debates, forget the 
ethical assumptions of their discipline. 
Towards the end, Stern distils his thoughts 
on the international climate scene. Although 
he overlooks some key respects in which a 
binding treaty — the Kyoto Protocol — was 
crucial in forming European climate policy, 


huge value remains in his wisdom and his 
experience of international debates. 
However, Stern’s coverage of lessons from 
economic history and theory is messier. It 
meanders through market failures, policy 
uncertainties, the European Union's emis- 
sions-trading system, the debate over reserves 
of fossil fuels that must be left unburnt to sat- 
isfy emissions targets, shale gas, the German 
nuclear phase-out, failed Chinese dams and 
more. There are trenchant points about six 
market failures and the six areas of policy 
required to address them, but little structure 
behind the tales, and no compelling narrative 
on how policy failures might have been over- 
come. Instead, Stern embarks on a demoli- 
tion of his critics. The integrated assessment 
models beloved of many technical analysts 
come in for particular ire. Stern views them 
as “simplistic attempts to shoehorn the deep 
and dynamic issues into inappropriate or 
narrow models’, noting that many “can be 
profoundly misleading’, and evoking John 
Maynard Keynes’ old punchline: “It is better 
to be roughly right than precisely wrong” 
Most seriously, Stern has misdiagnosed the 
major obstacles to climate action as a failure of 
policy-makers to understand the scale of the 
risks, to grasp that delay is dangerous or to 
appreciate the feasibility of and potential for 
low-carbon paths. These are not the root of 
the problem, and Stern never gets to it, despite 
providing much of the intellectual material. 
Most policy-makers accept the need for 
action. Ignorance, inattention and ideology 
have all played a part in the waiting game, but 
it also springs from microrealities of politics 


SPRING BOOKS | COMMENT | 


and economics. Stern is a big thinker, used 
to the broad sweeps of economic develop- 
ment and global issues. But the crux is local 
people and businesses. Voters might like the 
idea of clean energy, but oppose wind farms 
next door; back emission reductions and pro- 
fess support for market-based solutions, but 
oppose increased energy prices. There is little 
on energy prices in Why Are We Waiting?, but 
climate policy in Brussels and Washington 
DC is concerned with little else. 

The real challenge in controlling climate 
change is many-layered, but must include 
attempts to align global need for transforma- 
tion with the old adage that all politics is local. 
A brief reference to border measures, to deter 
industries from migrating to escape carbon 
controls, signals a shift in Stern’s thinking; but 
the argument is not developed further. 

Stern deserves much credit for moving 
the economics debate on so far in a decade. 
But Why Are We Waiting? is a depressing 
reminder of the sheer size of the elephant. 
The complexities of climate change are 
beyond the ken of any individual, even such 
a big thinker as Stern. Developing the insti- 
tutions, intellectual frameworks and impact 
channels required for effective ‘solution sci- 
ence, matched to the sheer range and scale 
of the problems and opportunities, remains 
very much a workin progress. m 


Michael Grubb is professor of international 
energy policy and climate change at University 
College London, editor-in-chief of Climate 
Policy, and author of Planetary Economics. 
e-mail: m.grubb@ucl.ac.uk 
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Since James Watson and Francis 
Crick solved the double helix, biologists have 
imagined DNA as an information-storage 
device: magnetic tape, a computer program 
or, most commonly, a book that contains the 
instructions for making a cell’s proteins. In 
multicellular organisms, this precious tome 
is secured in the vault of the nucleus, the 
membrane of which isolates and protects 
nature from nurture. 

But if a genome is text, it is badly edited. 
Most DNA is gibberish, full of stutters, snip- 
pets of doggerel from other species, and ech- 
oes of quiescent viruses. In humans, only 


about 2% of the genome encodes proteins. 
Much — but not all — of the remaining 
98% is evolutionary detritus. In the 1960s, 
researchers learned that non-coding DNA 
can serve vital functions, such as regulating 
gene action and building ribosomes. The 
remainder they began to call junk. 

Today, junk DNA isat the heart of the most 
radical transformation of how we understand 
the genome since the information metaphor. 
Three books — The Deeper Genome by John 
Parrington, Junk DNA by Nessa Carey and 
Biocode by Dawn Field and Neil Davies — 
present a vision of the twenty-first-century 
genome. Their relative success hinges on 
metaphor and imagery, both in how they 
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conceive the genome and in the writing itself. 

In September 2012, the ENCODE (Ency- 
clopedia of DNA Elements) consortium 
announced that its multi-year international 
effort to catalogue the various types of DNA 
sequence had assigned “biochemical func- 
tion” to 80% of the human genome. Incau- 
tious reporters began shouting that junk 
was bunk, even though scientific consen- 
sus maintains that most genomes contain 
large amounts of it. The subsequent debate 
upregulated public interest in non-coding 
DNA — but how do we talk about DNA now? 

The title Biocode forestalls any doubt > 
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> that the authors hew to the information 
metaphor. Breathlessly, Field and Davies 
survey the greatest hits and promises of 
genomics, including Jurassic Park-style reani- 
mation of extinct species, the microbiome and 
environmental engineering. The thin chap- 
ters blurt out strings of recent findings, each 
capped with a crescendo of sensational specu- 
lations that mostly rehearse familiar ethical 
questions. Critical distance is achieved with 
the time-honoured double negative: “Might 
a lawyer one day argue that deliberately not 
giving [our children] the best genes available 
is a form of abuse? It is not inconceivable to 
imagine a world where natural reproduction 
would seem primitive and even barbaric.’ It 
concludes by exhorting us to set our sights on 
a global genome project to understand “the 
software that shapes our living planet”. The 
biocode is Gaia plus DNA. But two clichés do 
not make a right. Biocode simply extends the 
text metaphor to the macrocosm. 

The old metaphor is not wrong; it is incom- 
plete. In the new genome, lines of static code 
have become a three-dimensional tangle of 
vital string, constantly folding and rearrang- 
ing itself, responsive to outside input. The 
roots of this idea run deep. In her 1983 Nobel 
lecture, geneticist Barbara McClintock called 
the genome a “sensitive organ of the cell”. 
McClintock, who discovered mobile genetic 
elements in the 1940s, had named them con- 
trolling elements because she thought they 
composed the regulatory system that gov- 
erned gene action. In 1980, Ford Doolittle 
and Carmen Sapienza proposed that transpo- 
sons were molecular parasites, jumping into 
genomes to propagate themselves. Parasitic 
transposons are now textbook knowledge, but 
McClintock's larger point holds: the genome 
is dynamic, full of regulatory elements that 
respond to environmental cues. 

The Deeper Genome is the only book of 
the three that credits McClintock as a pro- 
genitor of the three-dimensional genome. A 
scientist and journalist, Parrington covered 
the ENCODE story for The Times in 2012; 
his book enriches those accounts with his- 
torical and scientific context. The science is 
better than the history. He provides a fine 
discussion of recent support for McClintock’s 
often-overlooked late work on how stress can 
activate transposition, but he perpetuates the 
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myth that at first no one thought transposi- 
tion was real. The contested point was actu- 
ally McClintock’s interpretation of mobile 
elements as controllers of gene action. 
Parrington’s strongest chapters survey the 
emerging view of gene regulation, includ- 
ing DNA folding, epigenetics and regulatory 
RNA. Overall, this is a faithful, engaging por- 
trait of the twenty-first-century genome. 
Finally, Junk DNA, like the genome, is 
crammed with repetitious elements and 
superfluous text. Bite-sized chapters parade 
gee-whizz moments of genomics. Carey’s The 
Epigenetics Revolution (Columbia University 
Press, 2012) offered lucid science writing 
and vivid imagery. Here the metaphors have 
been deregulated: they metastasize through 
an otherwise knowledgeable survey of non- 
coding DNA. At one point, the reader must 
runa gauntlet of baseball bats, iron discs, Vel- 
cro and “pretty fabric flowers” to understand 
“what happens when women make eggs”. The 
genome seems to provoke overheated prose, 
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unbridled speculation and Panglossian opti- 
mism. Junk DNA producesa lot of DNA junk. 

The idea that the many functions of non- 
coding DNA make the concept of junk DNA 
obsolete oversells a body of research that is 
exciting enough. ENCODE'’s claim of 80% 
functionality strikes many in the genome 
community as better marketing than science. 

Still, as with McClintock, the larger point 
holds: the genome is more than a set of rules 
and parts descriptions. Finding apt imagery 
to replace the dead metaphor of the ‘instruc- 
tion book of life’ could enable us to break free 
of the cliché of nature versus nurture. It could 
usher in a more democratic conception of life, 
in which all the world’s a cell, and all the genes 
and genomes merely players. m 


Nathaniel Comfort is professor of the 
history of medicine at Johns Hopkins 
University in Baltimore, Maryland. He is 
working on a biography of DNA. 

Twitter: @nccomfort 


Neanderthal Man: In Search of Lost Genomes 


Pioneer of ancient-DNA studies Svante Paabo was inspired in his youth by ancient 
Egyptian history. Feeling that this field moved too slowly, he decided to study medicine 
instead, and went on to sequence the first full Neanderthal genome in 2010. Here 

he details the technicalities of his life’s work and the incremental discoveries, such as 
genetic intimations that modern humans and Neanderthals had mixed, which generated 
our theories of human evolution. (See Henry Gee’s review: Nature 506, 30-31; 2014.) 


ORIGINS OF LIFE 


An improbable journey 


Adrian Woolfson enjoys two studies on microbial life’s trek towards complexity. 


scientist Antoni van Leeuwenhoek sub- 

mitted an essay to the Royal Society of 
London detailing a singular discovery. This 
was the world of unicellular organisms, 
which he observed using a self-designed 
microscope. Three hundred years later, 
Leeuwenhoek’s “animalcules” were shown 
to hold the secret to the evolution of complex 
life on Earth. 

In his imaginative and beautifully written 
The Vital Question, evolutionary biochem- 
ist Nick Lane defines a genealogy that links 
the descendants of the Cambrian explosion 
— the first appearance of morphologically 
complex animals in the fossil record, 
about 540 million years ago — to the 
simple organisms that preceded them. 

In so doing, he persuades us that com- 
prehending the structure, function, 
behaviour, genetics and evolution of 
microorganisms is necessary for a 
deep understanding of complex life, 

and of the processes that undermine it, 
including diseases and ageing. This vis- 
ceral insight into the largely uncharted 
expanses of microbial existence could 

also form the basis of a predictive science 
enabling us to speculate about the nature of 
potential life on other planets. 

Biophysicist Paul Falkowski’s entertain- 
ing, easy-to-read and historically rich Life’ 
Engines, meanwhile, uses the work of micro- 
biologist Carl Woese to trace complex life 
back to its three lines of descent: bacteria, 
archaea and eukaryotes. By studying the 
RNA sequences of ribosomes — the cellular 
machines that make proteins — Woese was 
able to show that Charles Darwin was cor- 
rect in suggesting that all life arose from a 
single, now-extinct, common ancestor. 

It remains unclear how and when life first 
originated on Earth, but we know that the 
first unicellular organism emerged between 
3.6 billion and 2.7 billion years ago, giving 


le 1676, the Dutch merchant and amateur 


rise to bacteria and archaea, which have no 
nucleus or other sub-cellular organelles. 
The evolutionary engine of life then seems 
to have got stuck, idling along at the unicel- 
lular level for another 2 billion to 3 billion 
years. Falkowski explains how unicellu- 
lar organisms, although morphologically 
challenged, managed to perfect the basic 
biochemical ‘engines’ that would power all 
forms of life on Earth. According to Lane, 
the stagnation occurred because the molec- 


ular motors that drive the biochemistry of 


bacteria and archaea were unable to cross 
the energetic threshold necessary for the 
evolution of complex form. This energetic 


THE EVOLUTIONARY ENGINE 


GOT STUCK, 


IDLING ALONG AT THE 


UNICELLULAR LEVEL. 


constraint on life is the central focus of The 
Vital Question. 

It derives, Lane explains, from two princi- 
pal design features that all living things use 


to power themselves. The first is the use of 


high-energy molecules of ATP, the chemi- 
cal currency of energy transfer. The second 
is the idiosyncratic ‘chemiosmotic’ force, 
which moves protons and facilitates the 
continuous generation of ATP. Both Lane 
and Falkowski describe these molecular 
processes compellingly. Although adequate 
to power single bacteria-sized cells, the 
method constrains the allowable surface- 
to-volume ratio of a living cell. Lane argues, 
however, that around 1.5 billion years ago 
this energetic constraint was overcome by 
an improbable endosymbiosis event: an 


The Vital Question: Why is Life the Way it is? 
NICK LANE 
Profile: 2015. 


Life’s Engines: How Microbes Made Earth 
Habitable 

PAUL G. FALKOWSKI 

Princeton Univ. Press: 2015. 


ancestral archaean host engulfed a small 
population of symbiotic bacteria, resulting 
in the first eukaryotic cell, the forebear of 
complex life. 

Lane recounts how over time, the engulfed 
bacteria jettisoned most of their genes that 
were unrelated to energy production; these 

were either lost permanently or relo- 
cated to the cell nucleus. There they 
continued to fulfil their original func- 
tions, or formed the raw material for 
the evolution of new genes with unex- 
pected roles, such as transcription fac- 
tors — proteins that bind to DNA. This 
allowed embryonic stem cells to be 
patterned in three-dimensional space. 
What remained of the imbibed bacte- 
ria, with their pared-down genomes 
and surrounding membranes, became 
energy-generating mitochondria. The acqui- 
sition of these organelles enabled eukaryotic 
cells to expand their volume by up to 15,000 
times that of the average bacterium, and to 
support a genome around 5,000 times larger. 
Lane’s important realization is that this also 
gifted eukaryotic cells with about 200,000 
times more energy per gene than the aver- 
age prokaryotic cell. This over-cranking 
of the evolutionary engine allowed for the 
development of a baroque diversity in the 
nature and extent of cellular gene and pro- 
tein expression. 

Although readily accommodated by 
classic Darwinian evolutionary theory, the 
horizontal, sudden and co-operative nature 
of Lane’s evolutionary narrative differs from 
the incremental, vertical and competitive > 


The Homing Instinct: Meaning and Mystery in 


HOMIN Animal Migration 
INST] Nee Bernd Heinrich MARINER 2015 
spa, Erudite naturalist Bernd Heinrich attributes the 
A steratag instinct for migration to an affinity for ‘home’, from 


beavers’ skilful dam-building to the joyful dance of 
Alaskan cranes returned to their nesting pond. (See 
Joel Greenberg’s review: Nature 508, 317; 2014.) 


The Accidental Species: Misunderstandings of 
Human Evolution 

Henry Gee UNIV. CHICAGO UNIVERSITY PRESS 2015 
Nature’s palaeontology editor, Henry Gee, 
condemns the idea that our species is the 
pinnacle of evolution, arguing that traits prized as 
uniquely human, such as creativity, are not. (See 
Tim Radford’s review: Nature 503, 34-35; 2013.) 
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> features of the more canonical one. As 
elegant as the details underpinning the 
thesis seem, it is occasionally hard to 
distinguish between fact and specula- 
tion. Lane has, nevertheless, made a bold 
and commendable attempt to sketch out 
a highly challenging scientific issue for 
a general audience. In so doing, he has 
reaffirmed the importance of a largely 
overlooked area of basic research, and 
has generated testable hypotheses about 
the origins of complex life. 

Falkowski covers some of the same 
details of the evolution of microbes and 
their contribution to complexity, includ- 
ing the historical origins of the concept 
of endosymbiosis. However, his focus 
is primarily on how microorganisms 
have made Earth habitable, perhaps 
most notably with the development of 
oxygen-generating photosynthesis by 
cyanobacteria. This leads him to touch 
on humanity’s potential to undermine 
Earth's systems. 

History has shown how modifications 
to microbial biochemistry affect global 
geophysical processes. For example, fol- 
lowing 200 million to 300 million years of 
photosynthesis by ancient microorgan- 
isms, oxygen became a significant com- 
ponent of Earth’s atmosphere, increasing 
ozone levels, reducing the greenhouse 
effects of gases such as methane and lead- 
ing to one of the most extensive glacia- 
tions in the planet’s history. Humanity’s 
interference with natural biological pro- 
cesses risks damaging Earth in ways that 
cannot be predicted. 

What is clear is that a deep understand- 
ing of how complex life originated will 
provide insights into human biology 
and the nature of disease processes. It 
may also enable the generation of forms 
of life unconstrained by the contingent 
processes that locked life into its current 
trajectory. Life as we know it may eventu- 
ally be supplanted, perhaps one day even 
being viewed as a primordial soup that 
facilitated the emergence of silicon-based 
existence. m 


Adrian Woolfson is the author of Life 
Without Genes. 
e-mail: adrianwoolfson@yahoo.com 
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This Changes Everything: Capitalism vs the Climate 
Naomi Klein ALLEN LANE 2015 

Unafraid to name and shame fossil-fuel junkies 
hooked on a billion-dollar industry, Naomi Klein 
investigates capitalism and climate change. She sees 
the global crisis as a potential spur to positive action, 
as happened with the women’s rights movement. 
(See Nico Stehr’s review: Nature 513, 312; 2014.) 


Journal of the 
plague years 


Mark Dybul applauds the latest chapter in an account of 
a life at the leading edge of HIV research and policy. 


irologist Peter Piot’s AIDS Between 

\ Science and Politics is a terrific follow- 
up to his highly acclaimed memoir 

No Time to Lose (W. W. Norton, 2012). It 
demonstrates the deep intellectual lessons of 
a lifetime at the cutting edge of science and 
politics. Piot’s narrative ranges from his thrill- 
ing, on-the-ground experiences in remote 
regions of Africa as a young scientist and 
member of the team that identified Ebola, 
to the high-altitude reflections of his years as 
executive director of the Joint United Nations 
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Programme on HIV/AIDS (UNAIDS). 

Three messages underpin the book’s nine 
chapters. One is that the HIV epidemic gen- 
erated an unprecedented local and global 
response, recast many development and 
health paradigms, and ultimately triggered 
treatments that have saved millions of lives. 
The second is that progress was made only 
when various scientific disciplines, on- 
the-ground implementation strategies and 
politics were aligned. And the third? That 
AIDS is not over. 


The Boom: How Fracking Ignited the American 
Energy Revolution and Changed the World 
Russell Gold SIMON AND SCHUSTER 2015 
Journalist Russell Gold traces the rise of fracking, 
a tale of innovation and investment — such as 
ex-oilman Aubrey McClendon’s 260,000 land 
acquisitions in Texas’s Barnett Shale. (See Chris 
Nelder’s review: Nature 508, 185; 2014.) 
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HIV is one of the most 
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history, reducing life ET 
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national scale, it has oe 
Columbia Univ. Press: 

devastated groups of 5975 
the most marginal- 
ized and vulnerable people, including men 
who have sex with men, transgender people, 
people who inject drugs, sex workers, prison- 
ers and others. Piot also points out the links 
with historical injustices such as the 1948-94 
apartheid in South Africa, where the social 
fabric was so badly damaged and the mistrust 
of the authorities grew so deep that it exacer- 
bated the outbreak of the epidemic. 

Piot traces the early days of the response to 
HIV/AIDS, when governments closed their 
ears and wallets to the growing epidemic. 


People with the virus rallied to fight for atten- 
tion and resources, with increasing energy 
and sophistication. Remarkable individual 
and community action effected a breath- 
taking shift from a paternalistic, government- 
only approach to development and health, 
to one in which partnership and inclusivity 
provided a more effective response. 

From such a heady climate were born 
the Global Fund to Fight HIV, Tuberculosis 
and Malaria in 2002 and the US President's 
Emergency Plan for AIDS Relief (PEPFAR) 
in 2003, both aiming to provide funding for 
programmes in countries most affected by the 
diseases. Perhaps of even greater significance 
was the growth of multisectoral local institu- 
tions, including national AIDS councils in 
dozens of countries, often reporting to the 
head of state. And there has been an increase 
of billions of dollars in domestic financing for 
HIV/AIDS response. As a result, many mil- 
lions of lives have been saved and lifted up. 

The theme of effective partnerships that 
span disciplines, politics and theologies 
(religious, scientific and otherwise) per- 
meates the book, and is reflected in its title. 
The science is complex, as are the politics, 
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and they are intimately linked owing to 
social elements, and the discrimination and 
stigma they engender — what Piot calls the 
challenge of “sex, drugs and rock and roll”. 
Data, modelling and advocacy were essen- 
tial to move national and global politics, but 
political leadership also drove a demand for 
scientific advance, data and results. 

At a national level, Piot praises former 
president of Botswana Festus Mogae for his 
personal leadership in declaring, in 2001, 
that HIV was a threat to his country’s exist- 
ence. Mogae dedicated significant national 
resources to antiretroviral therapy at a time 
when much of the global health community 
scoffed at the viability of such a programme 
in Africa. These moves were strengthened 
by an alliance with the Bill & Melinda Gates 
Foundation and with pharmaceutical giant 
Merck, creating what Piot deems perhaps the 
most successful and impactful public-private 
partnership in history. At the global level, Piot 
singles out the push by then-UN secretary- 
general Kofi Annan for the creation of the 
Global Fund, and George W. Bush's efforts 
towards the inauguration of PEPFAR. 

Piot is honest about failures. He cites an 
effort he pioneered in the mid-2000s to 
engage the US and European pharmaceutical 
industries in expanding access to antiretro- 
viral treatment for people in low-income 
countries. The opposition came from a 
diverse group of people and institutions, 
including academics and political activists, 
who feared that participation by the pharma- 
ceutical industry would be geared towards 
increasing profits, not expanding access. 

He is also clear about the need to remain 
focused and vigilant. In his view, the epi- 
demic could be significantly reduced with 
sufficient resources, channelled to where 
they would make the most difference. This 
is the heavy baton that has been picked up by 
Piot’s successor at UNAIDS, Michel Sidibé. 

AIDS Between Science and Politics is a 
must-read for anyone interested in the HIV/ 
AIDS epidemic. More broadly, it offers les- 
sons — and interesting anecdotes — useful 
in the response to Ebola and indeed to every 
challenge in global health and development. m 


Mark Dybul is executive director of the 
Global Fund to Fight AIDS, Tuberculosis 
and Malaria in Geneva, Switzerland. 


The Knowledge: How to Rebuild Our World After 
an Apocalypse 

Lewis Dartnell VINTAGE 2015 

In a post-apocalyptic world, could we rebuild 
civilization? Lewis Dartnell condenses millennia of 
achievement into a handbook on mastering Earth’s 
resources to produce food, energy and medicines 
using our “greatest invention”: science. 


Islands Beyond the Horizon: The Life of Twenty 
of the World’s Most Remote Places 

Roger Lovegrove OXFORD UNIV. PRESS 2015 

In his profile of inaccessible islands, Roger 
Lovegrove’s admiration for wildlife shines. But 
from Russia’s ice-locked Wrangel Island, where 
polar bears banquet on walruses, to the Pacific 
Tuamotus, few such idylls remain pristine. 
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Pioneer of polio 


eradication 


Tilli Tansey extols a biography of determined 
vaccine trailblazer Jonas Salk. 


n 12 April 
1955, across 
the United 


States, “church bells 
tolled, horns honked, 
and sirens rang” in 
celebration: the larg- 
est clinical trial ever 
undertaken had 
reported that the first 


Jonas Salk: A Life 


polio vaccine was safe ee 
and effective. So writes °°” 

. . Oxford Univ. Press: 
Charlotte Jacobs in 9975, 


her riveting biogra- 

phy of the vaccine’s discoverer, Jonas Salk. 
Two years earlier, poliomyelitis had killed 
or paralysed almost 36,000 US children. It 
has been estimated that before the vaccine 
there were 600,000 cases a year worldwide. 
Salk’s triumph made hima global household 
name, and the gongs began rolling in. 

As Jacobs shows, the tale of Salk’s discovery 
is one of grind, intrigue, rivalry, politics and 
dirty tricks. Add commercial interests (phar- 
maceutical giant Eli Lilly made US$30 mil- 
lion from polio vaccine in 1955 alone) and 
Salk’s extramarital entanglements while 
wedded to artist (and muse to Pablo Picasso) 
Franc¢oise Gilot, and the mix becomes even 
headier. Jacobs contextualizes the polio effort 
with Salk’s work on influenza, multiple scle- 
rosis and HIV/AIDS — and with the Salk 
Institute for Biological Studies in La Jolla, 
California, which he created and directed, 
and from which he was ultimately excluded. 

Growing up in the Bronx, New York, in a 
family of Russian Jewish immigrants, Salk 
was dominated by his ambitious mother. He 
came to crave doing things his own way — a 


Water 4.0 


David Sedlak YALE UNIV. PRESS 2015 


tenacity that shaped 
both his scientific 
success and his pro- 
fessional difficulties. 
Early on, at the New 
York University Col- 
lege of Medicine, Salk 
worked with influenza 
expert Thomas Francis. 
At the time, antiviral vaccines 
(against smallpox, rabies and yellow 
fever) used artificially weakened live virus. 
Salk and Francis developed an experimental 
technique that used killed virus to stimulate 
antibody production and confer immunity, 
suggesting a powerful therapeutic approach. 

In 1941, Salk followed Francis to Michi- 
gan, pursuing the holy grail — an influenza 
vaccine. He modified laborious procedures 
to culture the virus and develop vaccine pro- 
duction, usually with only gloves and a mask 
as protection. He supervised clinical tests on 
patients at two psychiatric institutions, delib- 
erately infecting some with influenza — a 
practice common until the Nuremberg Code 
of 1947 offered some protection to human 
research subjects — and in 1945 advised the 
US surgeon general to vaccinate 8 million 
soldiers. Without consulting Francis, Salk 
signed an exclusive contract with pharma- 
ceutical firm Parke, Davis to provide details 
of production methods that he devised. This 
departure from academic etiquette did not 
go unnoticed; nor did his writing for non- 
professional publications such as Parents 
Magazine. Tensions grew as Francis received 
honours, while Salk was ignored. 

In 1947, Jacobs recounts, Salk left to 
establish an influenza lab at the University 


2 


of Pittsburgh in Pennsylva- 
nia. Harry Weaver, research 
director of the National 
Foundation for Infantile 
Paralysis (NFIP), invited 
him to work on typ- 
ing polio viruses and, 
eager for space, staff 
and equipment, Salk 
accepted. He made rapid 
progress, refining and 
replacing contemporary 
methodologies — often in the 
face of criticism or even prohibi- 
tion from the NFIP advisory commit- 
tee. Weaver supported his protégé, and the 
foundation came to see Salk as its ‘poster 
scientist, wheeled out for public and media 
events. Fellow researchers continued to carp. 
Undeterred, Salk pioneered a killed-virus 
vaccine and organized safety testing and 
field trials. Those leading up to the 1955 
announcement involved more than 1.5 mil- 
lion children, tens of thousands of doctors 
and nurses, and 220,000 volunteers. The 
clamour attendant on success (a film was 
mooted, to star Marlon Brando) laid Salk 
open to charges of self-aggrandisement. 
Almost immediately, problems arose. 
Batches of vaccine became contaminated, 
and physicians inoculated family and friends 
while leaving first-grade children, the most 
vulnerable group, unprotected. Over a few 
months, 260 individuals contracted polio 
directly or indirectly from a single substand- 
ard preparation. Several states suspended 
vaccination. Massachusetts halted its pro- 
gramme when children had received only the 
first of three shots; in July 1955, some 4,000 
contracted polio and 1,700 were paralysed. 
Salk was unfairly associated with these errors 
and was castigated, especially by scientific 


From Roman aqueducts to desalinization plants, TALF 
David Sedlak’s study overflows with facts about Y -~ 
water management. Chlorine by-products could 
be carcinogens, so he argues that water treatment © 
needs another upgrade. (See Margaret Catley- 

Carlson’s review: Nature 505, 288-289; 2014.) 
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colleagues. However, tighter adherence to his 


The Tale of the Duelling Neurosurgeons 

Sam Kean BLACK SWAN 2015 

Crammed with curious anecdotes from 
neuroscience’s gory past, Sam Kean’s book 
ranges from the crude methods of early brain 
studies (including the beheading of criminals to 
use as test subjects) to the prion disease kuru, 
which spreads through cannibalism. 


SPRING BOOKS | COMMENT | 


protocols and resumption of vaccination 
meant that six years after the vaccine was 
introduced, polio was almost eradicated 
in the United States. 

Much of the scientific establishment 
closed ranks against Salk. He was given the 
prestigious Lasker Award for clinical medi- 
cal research in 1956, but Swedish virologist 
Sven Gard dealt his Nobel nomination a 
fatal blow by sneering that the vaccine was 
a technical advance, not a discovery. Nor 
was Salk elected to the US National Acad- 
emy of Sciences. Virologist Albert Sabin — 
bombastic, imperious and galled by Salk’s 
success — continued to develop a live, 
orally delivered poliovirus preparation. By 
1961, Sabin’s vaccine had performed well 
in trials and the American Medical Asso- 
ciation began to promote it. Salk’s vaccine 
was, for a time, superseded, and his efforts 
to improve its potency stymied. 

Salk moved on, although he remained 
involved with the polio vaccine. Influenced 
by chemist C. P. Snow’s 1959 book The Two 
Cultures and the Scientific Revolution, he 
launched a research institute integrating 
social responsibility and the humanities 
with the biological sciences. The Salk Insti- 
tute recruited some of the great biologists 
of the time, including Jacob Bronowski, 
Francis Crick and Jacques Monod. But 
Salk was unable to translate his lofty ideals 
into practical management. His research 
from the 1960s onwards, on immune 
responses in cancer, multiple sclerosis and, 
later, HIV/AIDS, met with ambivalence. 
He was increasingly derided by the very 
scientists whom he had recruited. 

In many ways, Salk was ahead of his 
time, notably in public engagement and 
in his multidisciplinary agenda. A polio 
vaccine would have emerged without him, 
but it was his vision and willpower that 
produced the first, and a descendant of it 
is still the basis of many public-health pro- 
grammes. Yet universal polio eradication 
remains a dream: cases continue to appear 


One hundred years of 
general relativity 


Pedro Ferreira looks back at how Einstein himself anda 
panoply of other physicists have framed the theory. 


in Pakistan, Afghanistan and Nigeria, and 
have resurged in recent years in Syria. m 


Tilli Tansey is professor of the history of 
modern medical sciences at Queen Mary, 
University of London. 

e-mail: t.tansey@qmul.ac.uk 


ntil very recently, relativists were 

| | few and often self-taught. General 
relativity still had the stigma of 

being esoteric, pointless and, well, hard. 
In some places you could find special- 


ized graduate courses, but on the whole, 
if you were at all interested in expanding 


universes and black holes, you were left to 
your own devices. That is what happened 
to me. 

I studied engineering and did not enjoy 
it very much. But during the course on 
electromagnetism, I discovered Albert 
Einstein’s world of special relativity. > 


] Tesla: Inventor of the Electrical Age | = ] 

| W. Bernard Carlson PRINCETON UNIV. PRESS 2015 

Over-hyped eccentric or electricity wizard? Bernard 
Carlson’s account of Nikola Tesla’s life at the turn of the 
twentieth century recalls the inventor's great creations, 
such as the alternating-current motor, as well as the 
unfulfilled promise of wireless power. (See Patrick 
McCray’s review: Nature 497, 562-563; 2013.) 


Infinitesimal: How a Dangerous Mathematical 
Theory Shaped the Modern World 

Amir Alexander ONEWORLD 2015 

Through religious and revolutionary figures of 
the seventeenth century, Amir Alexander tells the 
history of the struggle for mathematics’ place in 
society. The ‘heretical’ concept of infinitesimals, 
the indivisible points of a line, takes centre stage. 
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>» The mathematics was seductive, the 
paradoxes were mind-blowing, and it set me 
up to try to learn his general theory of rela- 
tivity. That theory explains how the gravita- 
tional force is nothing more than space-time 
bending and warping as it responds to the 
presence of energy and mass. To under- 
stand this revolutionary viewpoint, I had 
to look for the right book, something that 
could lead me through all the intricacies of 
Riemannian geometry, which overturns the 
rules of Euclidean geometry that we learn in 
school. Yet I also needed to understand the 
physics: the bending of light and the orbit 
of Mercury. 

One book stood out: Einstein’s popular 
Relativity: The Special and the General 
Theory. Published in German in 1916, 
following Einstein’s groundbreaking 
1915 paper on the general theory of 
relativity, it was translated into English 
in 1920. In 2015, we see the publica- 
tion of a special anniversary edition, 
as well as an annotated version of the 
manuscript of the paper in Hanoch 
Gutfreund and Jiirgen Renn’s The Road 
to Relativity. 

Einstein’s book Relativity was 
supposed to be understandable by all, 
yet to have enough maths to allow the 
more educated reader to get into the 
guts of his ideas. It has very few equa- 
tions, rendering it less explanatory and 
more illustrative. But there are definitely a 
lot of words. Einstein set himself the task of 
explaining the concepts and ideas behind his 
theory, using situations from everyday life, 
such as trains moving on platforms or clocks 
on walls. His prose is tempered with some 
philosophical considerations, for example a 
discussion of the ‘a priori’ assumption that 
empty space exists. 

Dare I say it, I found the prose inelegant. 
This caught me by surprise. I had read some 
of Einstein’s 1905 papers, including the 
one introducing special relativity, and had 
thought them gems. Relativity, by contrast, 
was not particularly clear and a bit dull. 
Einstein had declared, in the introduction, 
that he would repeat himself frequently, 
“without paying the slightest attention to 
the elegance of the presentation” In this he 
might have been following the dictum of 
physicist Ludwig Boltzmann, who pinned 


Relativity: The Special and the General 
Theory (100th Anniversary edition) 
ALBERT EINSTEIN 

Princeton Univ. Press: 2015 


The Road to Relativity: The History and 
Meaning of Einstein’s “The Foundation of 
General Relativity” Featuring the Original 
Manuscript of Einstein’s Masterpiece 
HANOCH GUTFREUND AND JURGEN RENN 
Princeton Univ. Press: 2015 


down the concept of entropy and declared 
that “matters of elegance should be left to 
the tailor and to the cobbler”. Nevertheless, 
there is something honest about Einstein’s 
attempt at popular writing: he does not gloss 
over difficulties. His theory was, to some 


BIZARRE PREDICTIONS FROM 


GENERAL RELATIVITY, 


PREDICTIONS THAT 


EINSTEIN WAS WARY 


OF AT FIRST, HAVE 


STOLEN HIS THUNDER. 


extent, all there in his book. The treatment 
just did not seem to work, and he knew it. 
He told a friend, the Swiss—Italian engineer 
Michele Besso, that it was “quite wooden” In 
later years, he joked with the Polish physi- 
cist Leopold Infeld that the description 
“generally understandable” on the book’s 
cover should be changed to “generally not 
understandable”. 

Having given up on Einstein, I looked 
around and found much to choose from. 
As soon as Einstein had put his theory out, 
others took over and made it their own. 
Arthur Eddington, the UK astronomer 
who had measured the bending of light in a 
1919 eclipse expedition, wrote a beautifully 
crafted mathematical treatise on the theory 
of space-time in 1923. Erwin Schrodinger, 
one of the fathers of quantum physics, 
came up with his own, more-conceptual 
rendition, Space-Time Structure, published 
in 1950. Fellow quantum pioneer Paul 
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A Rough Ride to the Future 

James Lovelock PENGUIN 2015 

Independent scientist James Lovelock gazes at 
Earth’s past, present and future as the self- 
regulating system Gaia. Focusing on climate, he 
foresees humanity in 100 million years merged 


with artificial intelligence to survive a hotter Earth. 
(See Tim Lenton’s review: Nature 508, 41; 2014.) 
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Dirac’s reticent personality comes across 
in his lecture notes on the theory, number- 
ing just under 70 pages and published as 
The General Theory of Relativity in 1975. 
Euphoria and creativity pour out of the 
1,200-page behemoth Gravitation (1973) 
by John Archibald Wheeler and his disci- 
ples Charles Misner and Kip Thorne. I was 
spoiled for choice. 
Although I never used Einstein’s book, 

it kept cropping up in my life. I have a 
penchant for second-hand bookshops 
and would keep finding translations, each 
with its own story. The French version 
was first translated by Jeanne Rouvieére, 
a protégée of mathematician and politi- 
cian Emile Borel, and subsequently 
expanded by Einstein’s friend Mau- 


rice Solovine. The mathematician 
Tullio Levi-Civita, whose work had 


been instrumental in sucking Einstein 
into Riemannian geometry, recom- 
mended an engineer, Giuseppe Luigi 


Calisse, to do the Italian translation. 


The Russian version was translated by 
a Jewish logician-philosopher, Grego- 


rius Itelson, who lived in Berlin and 


was beaten to death in 1926 by an 
anti-Semitic crowd. 

Today, Einstein’s book is a historical 
curio. I don't think anyone still reads 


it as he intended. There have been so 


many attempts at popularizing the theory, 
from practitioners and journalists, that 
anyone can find a book to their taste. And 
we have learnt much in the century since 
it was published: a popular book on rela- 
tivity must now talk about the expanding 
Universe and the Big Bang, black holes and 
singularities. These bizarre predictions 
from general relativity, predictions that 
Einstein was wary of at first, have stolen 
his thunder. 

Yet I can still see some fugitive magic 
in Relativity, despite its “wooden” tone. It 
conjures Einstein as the oracle presenting 
a theory to the world — one of the most 
revolutionary and profound theories of all 
time. m 


Pedro Ferreira is professor of astrophysics 
at the University of Oxford, UK, and author 
of The Perfect Theory. 

e-mail: p.ferreiral@physics.ox.ac.uk 


Junkyard Planet: Travels in the Billion-Dollar 
Trash Trade 

Adam Minter BLOOMSBURY 2015 

One man’s trash is another’s treasure in Adam 
Minter’s exploration of the US$500-billion global 
recycling trade. US waste tops the charts, and 
China’s electronics-manufacturing industry sifts 
gold from mountains of e-waste. Emily Banham 
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Germline edits: heat 
does not help debate 


Edward Lanphier and colleagues 
marshal a familiar and, we believe, 
ultimately counterproductive line 
of argument against genetically 
modifying the human germline 
(Nature 519, 410-411; 2015, 

see also Nature http://doi.org/3xt; 
2015). 

In his essay The Rhetoric of 
Reaction (Belknap, 1991), the late 
economist Albert Hirschman 
describes three simplifying, 
stereotypical theses that are often 
invoked to preserve prevailing 
policies or practices. Lanphier 
et al. follow these same well- 
travelled tracks. They warn that 
a public outcry over the ethics of 
germline editing could hinder 
therapeutic applications of non- 
heritable genetic modification 
of somatic cells (the perversity 
thesis); that the technique might 
not deliver the anticipated 
benefits (the futility thesis); 
and that modifying genes in 
reproductive cells could bea 
slippery slope towards non- 
therapeutic genetic enhancement 
(the jeopardy thesis). 

Such blinkered arguments can 
lead to sharply polarized views, 
obstructing the balanced and 
pragmatic societal discussion 
and careful studies that should 
precede possible applications of 
human germline modification 
(see D. Baltimore et al. Science 
348, 36-38; 2015). 

Mika Martikainen, Ole Pedersen 
Newcastle University, UK. 
mika.martikainen@newcastle.ac.uk 


Germline edits: trust 
ethics review process 


Edward Lanphier and colleagues 
contend that human germline 
editing is an unethical 
technology because it could have 
unpredictable effects on future 
generations. In our view, such 
misgivings do not justify their 
proposed moratorium (Nature 
519, 410-411; 2015). 

When in vitro fertilization and 
preimplantation genetic diagnosis 


were first introduced, they had 
unpredictable consequences. 
Both went on to transform 
reproductive medicine. 

Many nations already have 
ethics review processes that 
assess the risks of experiments 
on human embryos, with 
the prospect of even stricter 
evaluation standards as new 
fertility techniques come along 
(I. G. Cohen et al. Science 348, 
178-180; 2015). 

There is no reason to close off 
whole avenues of controversial 
research when they have 
barely begun (see, for example, 
Nature http://doi.org/3xt; 
2015). Germline editing is a 
revolutionary technology that 
potentially offers an enormous 
range of benefits to the next 
generation. 

Julian Savulescu, Chris 
Gyngell, Tom Douglas 
University of Oxford, UK. 
julian.savulescu@philosophy. 
ox.ac.uk 


Lift sanctions now to 
save public health 


The international accord on 
Iran’s nuclear programme agreed 
this month in Switzerland 

offers no timeline for lifting 
international economic 
sanctions, which profoundly 
affect public health (see, for 
example, S. Shahabi et al. Nature 
520, 157; 2015). This must be 
urgently rectified because, as 
the Iranian health minister has 
stressed, it will take a year to 
restore public-health systems 
after sanctions are lifted. 

By driving up prices and 
limiting the availability of 
medicines, sanctions are 
forcing people and clinics to 
use poor-quality, black-market 
medications. A shocking example 
still under investigation is last 
month's unexpected permanent 
loss of vision in 15 people after 
eye surgery in a Tehran clinic, 
with unknown numbers similarly 
affected at two other centres. The 
tragedy is thought to have arisen 
from a non-standard ampoule 


(see go.nature.com/oyudtj; in 
Persian). 

Health care is a fundamental 
human right (see go.nature. 
com/xqtarv and go.nature. 
com/xuoeyb). Specialists in 
human rights, such as the United 
Nations special rapporteur on 
human rights in Iran, Ahmed 
Shaheed, should remind the 
US Congress and other Western 
governments of the importance 
of lifting sanctions and of a clear 
strategy to rapidly improve the 
country’s public health. 

Mehdi Aloosh Montreal, Canada. 
Arash Aloosh BI Norwegian 
Business School, Oslo, Norway. 
md_aloosh@hotmail.com 


Double-blind peer 
review a double risk 


Double-blind peer review of 
research papers is a worthy idea 
but has two flaws in practice 
(Nature 518, 274; 2015). 

First, most modern research 
builds on previous studies 
published by a limited number 
of groups. This makes it almost 
impossible to write a paper 
without revealing with near 
certainty who the authors are, 
even if they manage to avoid 
such giveaways as “we showed 
previously that ... (No Longer 
Anonymous et al., 2012)”. 

The second flaw is more 
serious. To function in our 
increasingly competitive research 
culture, in which misconduct 
is on the rise, researchers need 
to be aware of which labs can 
be trusted and which have a 
record of irreproducibility. If 
ahighly regarded lab and one 
with a questionable reputation 
each submit reports of similar 
investigations, a good reviewer 
would be extra vigilant in 
assessing the less-reliable lab’s 
study, even though the same 
evaluation standards would be 
upheld for both. 

Double-blind peer review 
removes this crucial quality- 
control option, opening the 
way for mediocre and bad labs 
to clutter the literature with 


sub-standard science. 
Thomas E. DeCoursey Rush 
University Medical Center, 
Chicago, Illinois, USA. 
tdecours@rush.edu 


Better policing for 
fishy catch data 


Missing catch data are a 

big problem in achieving 
sustainability of world fisheries 
(Nature 519, 280-282; 2015). 
The accuracy of catch data is also 
important — particularly for 
endangered fish species. 

Take the common skate 
(Dipturus batis), now classified 
as critically endangered in the 
Red List of the International 
Union for Conservation of 
Nature. In 2009, the European 
Union puta total ban on fishing 
for this species and made it 
illegal to retain these fish on 
vessels, exchange them between 
boats, or land them. 

We were therefore surprised to 
learn that official landing records 
of D. batis in 2014 by UK vessels 
in the United Kingdom and 
Europe, and by foreign vessels in 
UK ports, showed commercial 
catches totalling 1.8 tonnes. In 
2011-13, the total recorded catch 
was even higher at 7.8 tonnes 
(all data by written request from 
the UK Marine Management 
Organisation). 

The reliability of these 
figures is unclear, however, 
given that catch totals are 
compiled from assorted data 
collected at numerous UK ports. 
Local inaccuracies might be 
perpetrated through systematic 
errors in species identification, 
for example, or by mistakes in 
the codes allocated at market. 

It is essential for the future 
of vulnerable species that catch 
data are properly checked by the 
authorities to monitor adherence 
to bans, and that punitive action 
is taken against contraventions. 
David W. Sims, Samantha J. 
Simpson Marine Biological 
Association of the United 
Kingdom, Plymouth, UK. 
dws@mba.ac.uk 
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For News & Views online, go to 
nature.com/newsandviews 


An essential passenger with p53 


Deletion of the TP53 gene, an event seen in colorectal cancers, often occurs with co-deletion of a gene that encodes an enzyme 
subunit governing gene transcription. This creates a vulnerability ripe for therapeutic development. SEE LETTER P.697 


JAMES E. BRADNER 


here are increasing numbers of cancer 
sk that successfully target cell- 
growth proteins that are activated by 
mutation. But when proteins that restrict cell 
growth, called tumour suppressors, are inacti- 
vated by mutation or deletion, few therapeutic 
strategies exist. Often, tumour-suppressor loss 
creates a pathway-specific cellular dependency, 
such as the reliance of BRCA-mutant breast- 
cancer cells on the enzyme poly(ADP-ribose) 
polymerase’”. Regrettably, we have no general 
strategy for targeting tumour-suppressor loss, 
and nowhere is a creative solution to this chal- 
lenge more urgently needed than in the con- 
text of the tumour suppressor p53. Alteration 
of the p53-encoding gene TP53 is the most 
common genetic event observed among 12 
of the most common cancer types (42% of 
tumours)’. Yet despite definitive biological 
characterization over decades of study, there is 
no drug that acts directly on the p53 pathway. 
On page 697 of this issue, Liu et al.’ report that 
deletion of TP53 in many colorectal cancers is 
often accompanied by deletion of one copy of 
a subunit of the enzyme RNA polymerase II. 
This co-deletion renders such hemizygous 
cells sensitive to polymerase inhibition by the 
natural toxin a-amanitin. 
p53 is a transcription factor that activates 
the expression of cell-cycle checkpoint genes 
such as CDKN1A in response to DNA damage, 
thereby arresting cell proliferation. In cancer, 
alteration of TP53, either by mutation or dele- 
tion, promotes cell proliferation and survival. 
Mutant p53 lacks a capacity for transcriptional 
activation, and so direct-acting small mol- 
ecules would need to have a compensatory, 
corrective function. Such molecules have 
proved elusive. Hemizygous deletion of TP53 
results in reduced abundance of p53, establish- 
ing a rationale for stabilizing remaining p53 by 
targeting the protein MDM2 — a ubiquitin- 
ligase enzyme that promotes p53 degradation’. 
Derivatives of the small molecule nutlin-3a, 
which disrupts the MDM2-p53 interac- 
tion, have recently transitioned to human 
clinical investigation. But there remains a 
pressing need for therapeutics that target 
p53 loss. 
Deletions in cancer genomes are not limited 
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Figure 1 | Genetic vulnerability to transcriptional inhibition in colorectal cancer. Liu et al.* show that 
the large deletion on chromosome 17 that underlies many human colorectal cancers leads to hemizygous 
(one copy) loss not only of the TP53 tumour-suppressor gene, but also of the gene POLR2A, which encodes a 
subunit of the essential enzyme RNA polymerase II. This event results in decreased abundance of the enzyme 
in the cancer cells, such that the cells are killed by much lower concentrations of the natural toxin a-amanitin 
than kill non-cancerous cells that express both copies of POLR2A (idealized survival curves are shown). 


to tumour-suppressor genes, and often involve 
bystander or ‘passenger’ genes. There is an 
increasing appreciation that collateral genetic 
damage to passenger genes may create unique 
vulnerabilities in cancer cells. For example, 
a study’ of the aggressive brain tumour glio- 
blastoma multiforme found that the common 
causative mutation — deletion of both copies 
of the gene that encodes the metabolic enzyme 
enolase 1 (ENO1) — creates a heavy reliance 
on the remaining copies of the gene encoding 
the back-up enzyme enolase 2 (ENO2). Drugs 
that target ENO2 selectively killed ENO1- 
deleted cancer cells. 

Another study’ built on the observation 
that most human tumour cells harbour partial 
loss of at least 10% of their genome, typically 
as a result of the loss of whole chromosomes 
or chromosome arms. Among partially lost 
genes, the authors of that study found that 
hemizygous deletion of the gene PSMC2, 
which encodes a regulatory subunit of the 
essential proteasome protein-degradation 
complex, rendered ovarian-cancer cell lines 
highly sensitive to reduced PSMC2 expression 
(generated by the technique of RNA interfer- 
ence), both in vitro and in vivo. 

Liu et al. extend this logic to study hemi- 
zygous loss of TP53 in colorectal cancer. The 
authors find that regional deletions on chro- 
mosome 17 commonly involve both TP53 
and a neighbouring gene, POLR2A. This 
gene encodes the largest subunit (subunit A) 
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of RNA polymerase I, which transcribes 
messenger RNA from protein-coding genes, 
an essential function of all human cells. 
The authors report that hemizygous loss of 
POLR2A results in decreased abundance of 
RNA polymerase II in colorectal-cancer cells, 
leading them to suggest that these altered 
cancer cells could be more vulnerable to 
transcriptional inhibition than unaltered 
normal cells. 

Using chemical and genetic approaches in 
defined colorectal-cancer cell lines, Liu et al. 
observed a crucial dependence on the remain- 
ing copy of POLR2A (Fig. 1). They found that 
hemizygosity for POLR2A renders colorec- 
tal cancer cells ten times more sensitive to 
a-amanitin, and highly sensitive to silencing 
of POLR2A-gene expression by RNA interfer- 
ence. Careful controls validated these find- 
ings, including desensitization to a-amanitin 
by restoration of normal POLR2A expression, 
and comparisons of engineered paired cell 
lines with or without hemizygous alteration 
of POLR2A. 

To assess the potential clinical relevance of 
these findings, the authors conducted RNA- 
interference and a-amanitin-efficacy studies 
in immunocompromised mice bearing human 
tumours. To overcome the inadequacies of 
d-amanitin as a therapeutic agent (such as its 
systemic toxicity), an antibody—drug conjugate 
was used to deliver a-amanitin to tumours 
through binding to the cell-surface protein 


EpCAM. In all studies, sensitivity of POLR2A- 
hemizygous cells to transcriptional inhibition 
was confirmed. 

This discovery opens a therapeutic window 
for transcriptional inhibition in a common, 
genetically defined subtype of human can- 
cer. Limitations of the approach include the 
likelihood of drug resistance arising through 
duplication (or further amplification) of the 
remaining intact POLR2A gene, and the reli- 
ance on the natural poison a-amanitin. Nev- 
ertheless, these studies establish powerful 
reagents and a strong rationale for studying 
inhibitors of transcription and transcriptional 


signalling in cancer. Immediate next steps for 
this research are likely to include assessment 
of inhibitors of transcriptional kinase enzymes 
(for example, CDK7 inhibition by THZ-1) and 
modulators of chromatin-associated tran- 
scription factors (such as BET bromodomain 
inhibition by JQ1). Altogether, this study pro- 
vides further validation that investigation of 
genomic loss can reveal previously unrecog- 
nized and actionable dependencies, establish- 
ing hope of a therapeutic inroad to managing 
TP53 alteration in cancer. m 


James E. Bradner is in the Department 


Stillbirth prevented 
by signal blockade 


Bacterial infections during pregnancy can cause fetal death. The identification 
of a cell-signalling pathway induced during infection that recruits dangerous 
immune cells to the placenta may open up avenues to prevent this. 


ANNA BAKARDJIEV 


he birth of a dead baby during the 
[isene half of gestation is called still- 

birth. It isan emotionally devastating 
experience for affected families and a major 
global health problem, especially in the devel- 
oping world. Although there are many causes 
of stillbirth, maternal infections often play a 
key part, yet antimicrobial therapies are not 
effective. Writing in The Journal of Clinical 


Listeria 
infection 


Pregnant mouse 


Figure 1 | Chemokine signalling triggers fetal death. During mammalian 
pregnancy, there is contact between the outermost fetal-derived cells of the 
placenta, the trophoblasts, and maternal cells in the uterine lining, which is 
called the decidua. Maternal immune cells that have the ability to recognize 
trophoblasts as being ‘foreign’ and to trigger an immune attack to kill the fetus 
are therefore typically kept out of the decidua. However, Chaturvedi et al.' 


Investigation, Chaturvedi et al.’ report a cell- 
signalling pathway that leads to intrauterine 
accumulation of maternal immune cells dur- 
ing infection, which in turn triggers fetal death. 
The description of this mechanism may lead 
to the development of therapies to prevent 
stillbirth and other infection-related compli- 
cations of pregnancy. 

Infection during pregnancy differs from 
infection in non-pregnant individuals 
because mammalian gestation is a unique 
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immunological situation. The fetus is geneti- 
cally different from the mother, which brings 
the danger that the maternal immune system 
will mount a damaging immune response 
against this ‘foreign’ entity. Several cellu- 
lar mechanisms prevent this’; for example, 
maternal immune cells that have the capac- 
ity to kill fetal cells, such as effector T cells 
(CD8* T cells), are kept away from the womb. 
Chaturvedi and colleagues show how one of 
these protective mechanisms is subverted 
when the mother is infected with Listeria 
monocytogenes. This bacterial pathogen causes 
a rare but often fatal food-borne disease (lis- 
teriosis) in humans and other mammals, and 
infection during pregnancy can lead to fetal 
death’. It has also been used for decades as a 
model system for studying intracellular patho- 
genesis and host defence’. 

Direct contact between fetal and maternal 
cells occurs almost exclusively in the placenta 
and the adjacent uterine implantation site, the 
decidua. The decidua is a fascinating tissue, in 
which fetal cells called trophoblasts are juxta- 
posed with maternal cells. It is of the utmost 


show that intravenous infection of pregnant mice with the bacterium Listeria 
monocytogenes can lead to an influx of immune cells called neutrophils into 
the decidua; these express the chemokine molecule CXCL9, which attracts 
other immune cells, CD8* T cells, that express the CKCL9 receptor, CXCR3. 
In the authors’ experimental model, in which these T cells all recognize fetal 
structures (antigens), this influx of cells leads to fetal death. 
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importance that maternal effector T cells that 
recognize fetal structures (antigens) be kept 
out of the decidua: if trophoblasts are killed 
by these cells, the life of the fetus is in danger. 
To help prevent this, the expression of genes 
encoding signalling molecules that attract 
T cells, such as CXCLY, are silenced in decidual 
tissue’. However, Chaturvedi and colleagues 
demonstrate that infection of pregnant mice 
with L. monocytogenes induces CXCL9 expres- 
sion in the decidua, an influx of CD8* T cells, 
and fetal death (Fig. 1). 

The authors used different strains of mice 
to pinpoint the connection between fetal- 
specific CD8* T cells and fetal demise. 
When they crossed female mice in which 
all CD8* T cells recognize a specific antigen 
(ovalbumin) with male mice in which all cells 
express that antigen (to provide a surrogate 
for fetal antigens and maternal CD8* T cells 
that recognize those antigens), they found 
that ovalbumin-specific T cells accumulated 
in the decidua after intravenous infection of 
pregnant animals with L. monocytogenes. 
These T cells express CXCR3, a receptor for 
CXCLY, and are recruited by CXCL9-secreting 
neutrophils (another class of immune cell) that 
move to the decidua after infection. Deplet- 
ing neutrophils or blocking CXCR3 before or 
shortly after bacterial infection correlated with 
fewer ovalbumin-specific CD8" T cells in the 
decidua and no fetal death. 

One of the caveats of this approach is that 
massive overexpression of a single antigen at 
the maternal-fetal interface does not reflect 
normal physiological conditions, nor does 
the presence of a large population of maternal 
T cells that all recognize that antigen with high 
affinity. This combination may lead to a more 
robust effect than occurs in normal pregnan- 
cies in which there are multiple antigens anda 
broad repertoire of T-cell specificities. Alterna- 
tive techniques, such as using antigens derived 
from genetically different mice’, might provide 
a more rigorous test of these ideas. 

Is systemic listeriosis sufficient to trigger 
this signalling cascade, or is decidual infection 
required? Chaturvedi et al. used an L. mono- 
cytogenes mutant strain that is cleared more 
rapidly from maternal organs than wild-type 
bacteria’, but that can grow in the murine 
placenta’, suggesting that direct intrauterine 
invasion is necessary. 

Further studies are needed to gather evidence 
for the role of this pathway in human pregnancy, 
because the anatomy of the maternal-fetal 
interface differs between mice and humans’. 
Nevertheless, Chaturvedi and colleagues’ 
study is exciting, because it may have identi- 
fied a mechanism that underlies the association 
of stillbirth with infection. Moreover, CXKCR3 
inhibitors are currently under investigation as 
therapeutic options for a broad range of dis- 
eases, including cancer, autoimmune condi- 
tions and infectious diseases'’. Although more 
clinical trials are needed to optimize drug 
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safety and efficacy, especially where treatment 
of pregnant women would be considered, 
the authors’ findings point to a new potential 
strategy for preventing stillbirth and other 
infection-related complications of pregnancy. m 
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Malaria runs rings 
round artemisinin 


In parts of southeast Asia, malaria parasites are showing resistance to the active 
ingredient in artemisinin -based antimalarial drugs. Delineation of a cell-signalling 
pathway might help to explain this phenomenon. SEE LETTER P.683 


JEREMY BURROWS 


are two separate parts, which are clearly 

from the same picture, but the pieces link- 
ing the two fragments elude you. This is the 
situation facing the malaria research com- 
munity. The first part of the puzzle is the fact 
that, in regions of southeast Asia, Plasmodium 
falciparum malaria parasites in the ring stage 
(an early blood-borne stage of the parasite’s 
life cycle) are showing varying degrees of 
resistance to dihydroartemisinin’”, the active 
component of the artemisinin class of anti- 
malarial drugs’. The second is the culprit in 


I magine an unfinished jigsaw puzzle. There 


eoe= 


this resistance, mutations in a protein called 
P. falciparum Kelch13 (PfKelch13), about 
which little is known’, so that a mechanistic 
understanding of how resistance is conferred 
is missing. In this issue, Mbengue et al.° 
(page 683) provide some of the missing pieces 
to connect these two parts of the puzzle. 
Drug resistance can arise in various ways: 
a biological target can change or mutate so 
as to reduce a drug’s ability to bind to it; the 
drug can become physically separated from 
the target, such that it can no longer exert its 
effect; or the biological target can increase to 
high enough levels to overcome the presence 
of the drug. Mbengue and colleagues show 


Pp 


> Degradation 


Cell signalling 
and survival 


Figure 1 | Mechanism of resistance. Mbengue et al.” report that, in the malaria parasite Plasmodium 
falciparum, levels of the PfPI3K enzyme are regulated by PfKelch13, a protein that binds to PfPI3K and 
mediates the addition of ubiquitin groups (Ub), tagging PfPI3K for degradation. PfPI3K is required to 
phosphorylate (P) the phospholipid phosphatidylinositol (PI) to form phosphatidylinositol 3-phosphate, 
promoting cell signalling and survival. The authors show that dihydroartemisinin (DHA), the active 
ingredient of artemisinin-based antimalarial drugs, inhibits PfPI3K activity in ring-stage parasites. 
Mutations in PfKelch13 that prevent it from binding to PfPI3K can increase PfPI3K levels and so help 


parasites to overcome the effects of DHA. 
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that dihydroartemisinin (DHA) is a potent 
inhibitor of a P. falciparum enzyme called 
phosphatidylinositol 3-kinase (PfPI3K), 
which phosphorylates an important phos- 
pholipid, phosphatidylinositol (PI), to pro- 
duce phosphatidylinositol 3-phosphate (PI3P) 
in ring-stage parasites. However, mutations 
in PfP13K do not correlate with resistance to 
DHA, indicating that this is not a main cause 
of resistance. The authors provide no evidence 
that the malaria parasite separates itself from 
DHA by destroying it or expelling it from its 
site of action, nor that the parasite hides in a 
protective state until DHA has dissipated’. 
This leaves the hypothesis that an increase in 
PfPI3K levels might confer resistance — but 
how is this linked to PfKelch13? 

Clues about the function of PfKelch13 come 
from the roles of similar proteins in mamma- 
lian cells. The human equivalent of PfKelch13 
binds to protein targets to promote molecu- 
lar ‘tagging’ in which ubiquitin molecules 
are added to the substrate (a process called 
polyubiquitination). These ubiquitin-tagged 
protein complexes can trigger other biochemi- 
cal processes, but they are also recognized and 
broken down by a cellular structure called 
the proteasome. Mbengue and co-workers 
reasoned that PfKelch13 might function in a 
similar manner in malarial ring-stage para- 
sites, with PfPI3K as its target (Fig. 1). They 
hypothesized that mutations in PfKelch13 
inhibit its association with PfPI3K. With less 
PfKelch13-PfPI3K binding, there would be 
less P{PI3K polyubiquitination, resulting in 
less degradation and higher cellular levels of 
PfPI3K. Indeed, the authors confirmed that 
increased levels of PfPI3K and PI3P were asso- 
ciated with certain PfKelch13 mutations. They 
had connected the two jigsaw pieces. 

This work contains some surprises. First, 
it seems that this mechanism of action is spe- 
cific to ring-stage parasites, implying that 
other reported biological consequences of 
DHA affect the parasite at other stages of its life 
cycle. By contrast, it is commonly assumed that 
a drug that kills parasites in different stages of 
its life cycle is driven by a single biological 
target, rather than different targets affecting 
different stages. 

Second, the authors demonstrated that DHA 
inhibits PfPI3K more potently than it does 
related human kinases, and that this inhibi- 
tion is reversible. DHA has a weak oxygen- 
oxygen (peroxide) bond, and the researchers 
provide data to suggest that this bond, along 
with other molecular features of DHA, is nec- 
essary for reversible PfPI3K inhibition. This is 
interesting, because all other mechanisms by 
which DHA kills parasites involve breaking the 
peroxide bond*”. 

The third surprise is that DHA is different 
in structure from most types of molecule 
that inhibit kinases, although Mbengue et al. 
use modelling to provide some possible 
explanations for this discrepancy. It will be 
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fascinating to carry out in-depth studies of 
interactions between DHA and PfPI3K. 

Finally, PfKelch13 mutations correlate with 
only a slight increase in levels of PfPI3K in 
resistant parasites, but the extent of ring-stage 
resistance to DHA seems to be ofa high mag- 
nitude, indicating that small alterations in 
Pf£PI3K levels can have a large effect on resist- 
ance. This suggests that resistance could be 
overcome by increasing the dose of artemisinin 
in drug combinations, to inhibit PfPI3K more 
potently — and such a study” has been inde- 
pendently conducted. 

Many questions remain unanswered, the 
most important of which is why parasites 
survive when levels of PfPI3K and PI3P 
increase. Furthermore, exactly how DHA 
inhibits PfPI3K remains to be seen. How do 
PfPI3K and PfKelch13 bind together, and why 
do mutations hinder this binding? Finally, 
why does polyubiquitination increase 
following PfPI3K-PfKelch13 binding, and 
does tagging lead to intracellular signalling 
events or solely to destruction of the complex 
by the proteasome? 

Will this knowledge of the ring-stage target 
of DHA help us to design better drugs, as the 
authors claim? Yes and no. Because resistance 
is due to factors other than PfPI3K mutation, 
new PfPI3K inhibitors will not only need 
to be very potent, but also, crucially, must 
show potency against the key ring-stage- 
resistant parasites. We know that mutations in 
PfKelch13 can lead to less PfKelch13-PfPI3K 
binding, and that this leads to resistance. Only 
time and a much greater understanding of the 
underlying biology will tell us whether the 
parasite might overcome PfPI3K inhibitors 
through mutations that further reduce binding 
between PfKelch13 and PfPI3K. From a drug- 
development perspective, given that there are 
other antimalarial drugs'"” in clinical devel- 
opment that have a weak peroxide bond but 
that differ from DHA, it will be intriguing to 
see whether they show inhibition of PfPI3K 
and resistance against the ring stage similar 
to those demonstrated by DHA”. 

It seems unlikely that this study will mark 
the completion of the malarial-resistance 
puzzle. A review of the field” has declared that 
“the debate continues”. I am sure that it will. m 
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Northern push for 
the bipolar see-saw 


Antarctic ice-core data show that abrupt changes of climate in the Northern 
Hemisphere in the last glacial period preceded associated shifts in Antarctica by 
about 200 years — indicating an oceanic coupling process. SEE LETTER P.661 


TAS VAN OMMEN 


ver the past 30 years, ice cores from 

Greenland, and subsequently from 

Antarctica, have progressively revealed 
a fascinating and unexpected picture of inter- 
hemispheric climate behaviour during the 
last glacial period, approximately 110,000 to 
12,000 years ago. In the north, the glacial cold 
was punctuated by a series of abrupt warming 
events, each followed by cooling over several 
centuries before jumping back to cold, glacial 
conditions. These Dansgaard—Oeschger (DO) 
events, as they became known, have counter- 
parts in Antarctic records, although with a 
different character: the Antarctic events show 
steady warming trends during Greenland’s cold 
phases that peak and reverse to cooling trends 
when Greenland warms (Fig. 1). 

As higher-resolution Antarctic ice-core 
records have been recovered, and tighter hemi- 
spheric synchronization of records has been 
achieved, it has become possible 
to unambiguously establish a one- 
to-one correspondence between 
DO events and the associated Ant- 
arctic temperature peaks. These 
peaks, observed in proxy form as 
variations of isotopic abundances of 
water in ice cores, are referred to as 


> 


Temperature proxy 


directionality and the relative timing of events, 
revealing a fairly slow connection that is con- 
sistent with hemispheric coupling by oceanic 
rather than atmospheric processes. 

The underlying hypothesis for the observed 
hemispheric climate coupling, known as the 
bipolar see-saw’, centres on the role of a 
large-scale ocean circulation in the Atlantic 
Ocean called the Atlantic meridional over- 
turning circulation (AMOC). The AMOC 
carries heat from the Southern Hemisphere 
northwards until increasing salinity and cool- 
ing cause it to sink at high northern latitudes, 
forming North Atlantic Deep Water, which 
returns southwards at depth. Strengthening 
of the AMOC leads to warm temperatures in 
the North Atlantic and cool temperatures in 
the South Atlantic, with the converse occur- 
ring when the AMOC weakens. 

This coupling might be expected to pro- 
duce a simple inverse pattern of Antarctic and 
Greenland temperatures, but that is not the 


Greenland 
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case. The large thermal inertia of the South- 
ern Ocean provides a heat reservoir in the 
south that accumulates the heat flow from the 
north. Including this effect in the bipolar-see- 
saw hypothesis gives a simple thermodynamic 
model that provides the basis for our under- 
standing of the observations’. This model 
reveals that the abrupt DO warm events dur- 
ing the last glacial period reflect times when 
the AMOC intensified and was accompanied 
by heat removal from the south and cooling 
trends in Antarctica. Conversely, cold periods 
in Greenland and the north correspond to 
reduced AMOC heat transport from the south, 
accompanied by Antarctic warming trends. 

This picture leaves questions of cause and 
effect unanswered, however. Potential driv- 
ers of AMOC variations include changes in 
freshwater balance, sea ice or ice shelves, any 
of which can initially affect deep-water for- 
mation in the north or the south. Buizert and 
colleagues suggest a straightforward interpre- 
tation of the north-south lag: the Northern 
Hemisphere AMOC shift triggers changes 
in the Southern Hemisphere, rather than the 
other way around. But, as others have noted®” 
and the authors recognize, the concept of a 
‘trigger’ may be poorly framed in a system of 
closely coupled oscillations — the system may 
be preconditioned for a state change by remote, 
as yet unidentified factors. 

Buizert et al. find that the Antarctic tem- 
perature maxima occur, on average, 218 +92 
years after DO warming transitions, and that 
the Antarctic minima lag behind the respec- 
tive DO ‘cold’ transition by 208 + 96 years. 
The size of the time delays is inter- 
esting for two reasons. First, they 
are much longer than would be 
expected if atmospheric processes 
were responsible, pointing to an 
oceanic coupling mechanism. 
And second, the delays are the 
same whether the AMOC intensi- 
> fies (the north warms) or reduces 


Antarctic Isotopic Maxima’ (AIM). 
It has been difficult, however, to 
constrain the relative timing or 
phasing of the north-south events 
to better than a few centuries, so the 
presence of leads and lags, or of a 
directionality to the north-south 
coupling, has remained the subject 


> 


Temperature proxy 


218492 years 


i 


208 + 96 years 


Antarctica 


(the north cools). Both of these 
observations should provide new, 
substantial constraints on the pro- 
cesses responsible for transporting 
heat into the Southern Ocean and 
across the barrier of the Antarctic 
Circumpolar Current. 


of speculation — given the errors, 
the changes seemed synchronous. 
On page 661 of this issue, Buizert 
et al.’ (WAIS Divide Project Mem- 
bers) report a clear Antarctic lag 
of around two centuries by using a 
new ice core from the West Antarc- 
tic Ice Sheet (WAIS) Divide. This 
establishes both a north-to-south 


Time 


Figure 1 Paired warming events in Antarctica and Greenland. Antarctica 
warms when there are cool phases in Greenland, and cools when Greenland 
is in its warm state, as recorded by proxies based on isotopic abundances of 
water trapped in ice cores. Buizert et al.” find that Antarctic maxima occur 
on average 218 + 92 years after abrupt warming in Greenland, whereas 
minima lag behind the cool state of Greenland by 208 + 96 years. The 
graphs depict idealized trends, and are not based on recorded data. 
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The findings from the WAIS 
Divide ice core approach the limits 
of such studies in determining the 
north-south delay. The errors are 
dominated by intrinsic uncertain- 
ties in working out the age of air 
trapped in the ice, and in detect- 
ing trend changes of ice-isotope 
abundances, which vary slowly and 


suffer from climate-related noise. The authors 
averaged several AIM events to reduce noise, 
but although obtaining an average delay is 
informative, the question remains as to what 
extent individual DO-AIM pairs might vary 
in timing. 

Studies of other Antarctic ice cores*® reveal 
that the variation of temperature with time 
during AIM events has geographic variability, 
and that two phases are typically visible in the 
AIM profile during warming. They are accom- 
panied by variations in other climate tracers 
that point to atmospheric-circulation changes 
that are not synchronous with Greenland 
events. Buizert et al. looked at atmospheric 
changes, and observed sea-salt variations dur- 
ing AIM that indicate synchronous changes in 
sea ice and temperature. Such variations may 
provide insight into changes in southern fresh- 
water forcing and ocean feedbacks that affect 
the bipolar see-saw. 

Establishing the relative timing between 
see-saw events in the two hemispheres is a big 
step forward, but the full extent of changes 
revealed by Antarctic ice cores, including the 
timing of changes in carbon dioxide level, 


MATERIALS SCIENCE 


remains under-exploited. An integrated under- 
standing of hemispheric climate coupling 
therefore awaits. Nevertheless, Buizert and 
colleagues’ findings are particularly compel- 
ling in the light of recent indications’ of a con- 
temporary slowing of the AMOC, which has 
been anticipated. Predicting the global effects 
of sucha change will pivot on our understand- 
ing of how the hemispheres communicate. = 
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Semiconductors grown 
large and thin 


Atomically thin layers of semiconductors called transition-metal dichalcogenides 
have been grown uniformly on the square-centimetre scale — paving the way for 
the ultimate miniaturization of electronic applications. SEE LETTER P.656 


TOBIN J. MARKS & MARK C. HERSAM 


he ubiquity of electronic devices today 
Tesi from the development of semi- 

conductor wafers that have exceptional 
spatial uniformity. These wafers enable the 
production of highly integrated circuits, 
because each of the billions of constituent 
transistors behaves predictably, with differ- 
ences between individual devices being among 
the smallest of any manufactured technology. 
Meanwhile, the miniaturization of transistors 
over the years has led researchers to consider 
the ultimate size limit: atomic-scale electronic 
devices. This limit has been reached in research 
laboratories with the fabrication of prototypes 
from atomically thin semiconducting materi- 
als’. But integrated circuits can be made from 
such devices only if atomically thin materi- 
als can be grown uniformly over large areas. 
On page 656 of this issue, Kang et al.” report 
a crucial step in this direction. They have 
achieved such uniformity on the wafer-scale 
— several square centimetres — for one of the 


most promising classes of two-dimensional 
semiconductor. 

Transition-metal dichalcogenides (TMDs) 
have the general formula MX,, in which M is 
a metal such as molybdenum (Mo) or tung- 
sten (W), and X can be sulfur, selenium or 
tellurium. They are semiconducting materi- 
als with 2D structures consisting of stacked, 
three-atom-thick X-M-X monolayers bound 
largely by interlayer van der Waals forces, 
roughly analogous to the structure of graphite. 
The appeal of TMDs has conventionally 
centred on their bulk forms’, for applications 
as lubricants, energy-storage materials and 
catalysts. However, intense interest in the 
properties of atomically thin, 2D electronic 
materials such as graphene has extended to 
MX, materials, because they offer the possi- 
bility of high-performance and mechanically 
flexible transistors, light detectors, solar cells 
and light-emitting devices. 

Most studies that have attempted to fabri- 
cate devices from atomically thin TMDs have 
used mechanically exfoliated forms (samples 
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50 Years Ago 


In an article in a recent issue of 
Minerva ... on “The President’s 
Science Advisers’, Dr. P. H. 
Abelson discusses the use which 
the Presidents of the United 

States have made of the service 

of a ‘Science Adviser’ since 1957. 
Dr. Abelson is concerned with the 
part which the Science Adviser 
and his staff have actually played, 
including his relation to the 

Office of Science and Technology 
created in March 1962, and more 
particularly he directs attention 

to some of the limitations of the 
system ... Dr. Abelson goes so far 
as to maintain that the Science 
Adviser and his staff have failed to 
address themselves to many major 
problems which might be expected 
to fall within the Science Adviser’s 
responsibility. Instead he believes 
they have been occupied with many 
relatively trivial problems, and 

the consequent exclusion of such 
questions as whether in the United 
States the present allocations of 
money and manpower has led to 
some discontent with the advisory 
system. 

From Nature 1 May 1965 


100 Years Ago 


The great consumption of petrol as 
a motor fuel, which last year, 

in spite of the disturbing element 
of war, rose to the enormous 
volume of 120 million gallons in 
England, and to nearly ten times 
that amount in America, has led to 
the attempt being made to add to 
the natural supply by the so called 
“cracking” of the heavy residual 
oils left after the petrol and the 
lamp oil have been distilled off 
from the crude oil ... The term 
“cracking” is one of those delightful 
Americanisms which express so 
exactly the meaning we wish to 
impart that it has been adopted 
universally. 

From Nature 29April 1915 
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S(CH,CH,),. 


Figure 1 | Large-area growth of atomically thin, layered semiconductors. 
a, Kang et al.’ report the fabrication of electronically uniform monolayers 
of semiconductors, MS, (M is molybdenum or tungsten), on the square- 
centimetre scale on silicon wafers coated with silica (SiO,). The process 


prepared by stripping off a few atomic layers 
from the bulk material, for example by using 
self-adhesive tape)’. Alternatively, irregularly 
shaped monolayers have been used, or frag- 
ments a few layers thick on electrically insu- 
lating dielectric substrates*. The next step 
towards large-scale manufacturing will require 
practical pathways for coating wafer-scale 
areas with structurally and electronically uni- 
form MX, monolayers on diverse substrates. 

With this goal in mind, the methods of 
chemical vapour deposition (CVD) and 
metal-organic chemical vapour deposition 
(MOCVD) offer the ability to produce large- 
area coatings without the need for the expen- 
sive vacuum equipment required for other 
approaches. These techniques typically use 
volatile precursors to deliver the chemicals 
required to coat heated substrates, sometimes 
with extra energy provided in the form of a 
plasma or light. The practicality of CVD and 
MOCVD processes has led to their use ona 
vast scale for growing coatings on substrates 
as diverse as continuous glass sheets, drill bits 
and optoelectronic devices. 

So far, MX, growth using these techniques 
has focused largely on thick, low-friction 
coatings, often with toxic and corrosive MF, 
precursors”® (F is fluorine) and highly toxic 
hydrogen sulfide (H,S) as a source of sulfur. 
Related approaches include aerosol-assisted 
MOCVD’ and passing H,S or sulfur vapour 
over hot metallic films or metal oxides (MO;) 
pre-deposited on a substrate*”. However, it 
seems unlikely that structurally and electroni- 
cally uniform MX, monolayers can be reliably 
grown over large areas by any of the above 
approaches. 

By contrast, Kang et al. describe an MOCVD 
process for growing MoS, and WS, mono- 
layers that have excellent spatial uniformity on 
silicon wafers coated with silica (SiO,). The 
process uses commercially available, vola- 
tile and relatively safe metal hexacarbonyl 
compounds, Mo(CO), and W(CO),, as 
precursors of molybdenum and tungsten, 
respectively, and diethylsulfide (S(CH,CH,),) 
as a sulfur precursor (Fig. 1). The authors also 
added hydrogen gas to the precursor stream 
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* 


M(CO), 


550 °C, H, 
26 hours 


to remove carbonaceous deposits that form 
during the growth process. 

In this way, Kang and colleagues prepared 
MoS, monolayers across square-centimetre 
areas, and used them to make arrays of micro- 
scopic devices called field-effect transistors 
with a 99% device yield — only 2 out of 200 
transistors characterized by the authors did not 
conduct. The electron mobility of the devices 
at room temperature was 30 square centi- 
metres per volt per second — good for TMDs 
— and the mobility was minimally dependent 
on the dimensions of the transistors or their 
location on the wafer. The authors went on to 
demonstrate that silica can be deposited on 
the TMD monolayer after the first MOCVD 
growth step, and that the MOCVD process can 
then be repeated, resulting in multiple electri- 
cally isolated layers of atomically thin TMDs. 
Three-dimensional electronic architectures 
can thus be made, offering the enticing pos- 
sibility of fabricating ultrahigh-density circuits 
— something that has not been possible using 
conventional silicon electronics. 

Although the new findings constitute a 
substantial advance for atomically thin semi- 
conductors, major issues must still be addressed 
before practical applications are possible. For 
example, the optimum growth conditions 
require a temperature of 550°C to be main- 
tained for 26 hours. This is too high a tempera- 
ture to be used with currently available, flexible 
plastic substrates, and so a lower-temperature 
process is required, or a method for transfer- 
ring large-area, atomically thin layers of TMDs 
from the growth substrate without introducing 
contamination, defects or wrinkles. The long 
growth time will probably also be an issue for 
high-throughput manufacturing. 

Although the observed electron mobili- 
ties are respectable for TMDs, they are still 
more than ten times smaller than those of 
bulk crystalline silicon. Further research 
is also needed to find ways of ‘doping’ the 
TMDs — for example, by adding trace quan- 
tities of impurities — to control the type and 
concentration of charge carriers. Finally, 
the threshold voltage of the devices must be 
tuned to allow the development of low-power 
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Monolayer MS, 


involves exposing heated wafers to precursor compounds — M(CO), and 
diethylsulfide (S(CH,CH;),) — at high temperatures in the presence of 
hydrogen gas. Sulfur, yellow; metal atoms, blue; oxygen, red; carbon, dark 
grey; hydrogen, light grey. Wafers are not shown. 


electronic architectures, such as the widely used 
complementary metal-oxide-semiconductor 
(CMOS) technology. 

The authors’ 2D semiconductors open up 
opportunities for making devices and circuits 
that are far beyond conventional field-effect 
transistors. For example, because monolayer 
TMDs are atomically thin, they do not fully 
screen electric fields applied perpendicular to 
the monolayer; this property might allow the 
development of ‘gate-tunable’ heterojunction 
diodes for application in high-speed com- 
munications circuits’’. Moreover, extended 
defects, such as grain boundaries in monolayer 
TMDs, can be manipulated using applied 
voltages. This might lead to the production 
of gate-tunable memristor devices, which are 
promising building blocks for ‘non-volatile’ 
computer memory and for neuromorphic 
(brain-like) circuit architectures'’. Widespread 
access to uniform, large-area monolayer 
TMDs will help to accelerate progress in these 
emerging areas, allowing the full potential of 
2D semiconductors to be quickly explored. = 
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A multilevel multimodal circuit enhances 


action selection in Drosophila 


Tomoko Ohyama'*, Casey M. Schneider-Mizell!*, Richard D. Fetter’, Javier Valdes Aleman’, Romain Franconville’, 
Marta Rivera- Alba‘, Brett D. Mensh’, Kristin M. Branson’, Julie H. Simpson’, James W. Truman, Albert Cardona!’ & Marta Zlatic'8 


Natural events present multiple types of sensory cues, each detected by a specialized sensory modality. Combining 
information from several modalities is essential for the selection of appropriate actions. Key to understanding 
multimodal computations is determining the structural patterns of multimodal convergence and how these patterns 
contribute to behaviour. Modalities could converge early, late or at multiple levels in the sensory processing hierarchy. 
Here we show that combining mechanosensory and nociceptive cues synergistically enhances the selection of the fastest 
mode of escape locomotion in Drosophila larvae. In an electron microscopy volume that spans the entire insect nervous 
system, we reconstructed the multisensory circuit supporting the synergy, spanning multiple levels of the sensory 
processing hierarchy. The wiring diagram revealed a complex multilevel multimodal convergence architecture. Using 
behavioural and physiological studies, we identified functionally connected circuit nodes that trigger the fastest 
locomotor mode, and others that facilitate it, and we provide evidence that multiple levels of multimodal integration 
contribute to escape mode selection. We propose that the multilevel multimodal convergence architecture may be a 
general feature of multisensory circuits enabling complex input-output functions and selective tuning to ecologically 


relevant combinations of cues. 


A single natural event, such as the attack of a predator, often presents 
multiple types of sensory cues, for example, visual, olfactory, tactile 
and noxious. These cues are detected separately by distinct sensors 
highly specialized for a single type of cue. Different sensory channels 
thus provide independent estimates of the same event that can be 
combined to improve sensitivity and reduce ambiguity’*. Indeed, 
across the animal kingdom, multi-cue events often elicit enhanced 
reactions compared to single-cue stimuli’ ®. 

The enhancement in reactions to multi-cue stimuli is a result of 
convergence and integration of information from multiple modalit- 
ies**°, Because neurons can compute non-linear functions of their 
inputs, where convergence happens, and how often, determines the 
repertoire of input-output functions’. In artificial neural networks, a 
single layer of nonlinear units has strict restrictions in possible res- 
ponses to combinations of inputs*?. Adding a second layer that pro- 
cesses combinations of outputs from the first layer enables the 
network to implement far more complex input-output functions*”° 
and thus to discriminate far better between different combinations of 
inputs. Similarly, in the nervous system, an architecture in which 
neurons that receive different combinations of inputs from two mod- 
alities in one level converge on neurons in downstream levels could 
enable better discrimination between different kinds of multimodal 
stimuli and selective tuning to specific ecologically relevant combina- 
tions of the two cues. 

The structural and functional architecture of multisensory conver- 
gence across multiple levels of the sensory processing hierarchy is an 
open question for any large nervous system. Neurons that integrate 
information from multiple modalities have been most extensively 
studied in higher-order and motor-control areas*°"''’, Multi 
sensory interactions have also been described at earlier stages of sensory 
processing’’**'. However, it is unclear whether the multisensory 


neurons observed at distinct levels are structurally or functionally con- 
nected, and whether they contribute to the same behaviour. 

The Drosophila larva is an excellent model system to study the 
structural and functional patterns of multimodal interactions across 
multiple levels of the sensory processing hierarchy and how these 
patterns contribute to behaviour. The relatively small size of its 
10,000-neuron nervous system makes it amenable to circuit mapping 
using large scale electron microscopy reconstruction”. Powerful gen- 
etic tools enable functional studies of uniquely identifiable neurons*”*. 

Larvae have distinct sensory modalities with which they can gauge 
the degree of threat””*' and an array of distinct escape behaviours 
available for responding to threats, such as turning, digging, fast 
crawling and rolling’’’*****. The most vigorous escape sequence con- 
sists of two locomotion modes: rolling followed by fast crawling”*”’. 
Rolling is the fastest—and presumably the most energetically 
costly””*. A range of distinct aversive cues can evoke the fast crawling 
response without rolling, including mild nociceptive and strong 
mechanosensory stimulation®”*. In contrast, rolling is observed in 
extremely threatening situations, such as strong noxious stimu- 
lation’”**?73°6, or predator attack’”*’, the latter of which provides 
multiple cues, including nociceptive and mechanosensory””*"””. 

Here we show that mechanosensory combined with nociceptive 
cues synergistically increase the likelihood of rolling, and use this 
tractable model system to determine the anatomical and functional 
patterns of convergence supporting the synergy. 


Multisensory cues enhance the selection of rolling 

We developed an automated multisensory assay in which we activate 
larval nociceptive multidendritic class IV (MD IV) neurons thermo- 
genetically*”°* and larval mechanosensory chordotonal neurons using 
vibration®**' (Fig. 1a, b and Extended Data Fig. la-d). 


1Howard Hughes Medical Institute Janelia Research Campus, 19700 Helix Drive, Ashburn, Virginia 20147, USA. 
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§These authors jointly supervised this work. 
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Figure 1 | Mechanical and nociceptive stimulation synergistically trigger 
rolling. a, Schematic of a high-throughput multisensory assay for presenting 
stimuli similar to wasp attack. Speaker-emitted vibration (green) sensed by 
chordotonal (Ch) neurons replaces the mechanosensory stimulus. 
Thermogenetic activation of MD IV neurons (orange) using the transgenic 
temperature-sensitive channel, dTRPA1, and exposing MD IV-GAL4>UAS- 
dTrpA] larvae to warmth (30°C or 32 °C), replaces the noxious sting. 

b, Chordotonal and MD IV neurons from each hemisegment project to distinct 
modality-specific ventrolateral and ventromedial regions of the nerve cord, 
respectively. A, anterior; P, posterior; D, dorsal; V, ventral. See Supplementary 
Table 1 for n (animals) in this and all other figures. See Supplementary Table 2 
for a list of genotypes used in this and all other figures. c, d, Percentage of 
animals that rolled in response to mechanical (green, MD IV-GAL4/+), 
nociceptive (orange, MD IV-GAL4>UAS-dTrpA1), combined (black, MD IV- 
GAL4>UAS-dTrpA1), or no (white, MD IV-GAL4/+) stimulation, when either 
MD IV activation is increased, given constant vibration (1,000 Hz, 6.7 m s 75), 
or vibration is increased, given constant nociceptor activation (32 °C; d). 
Error bars, 95% confidence interval; **P < 0.0001, ¢ test with Holm- 
Bonferroni correction. e, Hyperpolarizing chordotonal neurons reduced rolling 
in response to bimodal mechanical and nociceptive stimulation (32 °C and 
1,000 Hz, 6.7ms * in this and subsequent figures) using the transgenic inward 
rectifying potassium channel Kir in MD IV-LexA>lexAop-dTRPA1, ch- 
GAL4>UAS-Kir animals. Ctrl, controls. 


Different combinations of nociceptive and mechanosensory stimu- 
lation induced different likelihoods of the key escape sequences: rolling 
followed by fast crawling versus fast crawling alone. Nociceptor activa- 
tion alone evoked a relatively low likelihood of rolling (Fig. 1c, d, 
Extended Data Fig. le and Supplementary Video 1) and a high like- 
lihood of fast crawling” (Extended Data Fig. le, f). Vibration alone 
evoked only fast crawling’**”* (Extended Data Fig. le, f) and essentially 
no rolling (Fig. 1c, d and Supplementary Video 2). 

Combined with nociceptor activation, vibration increased the like- 
lihood of rolling; the effect is dose-dependent and super-additive 
(synergistic) (Fig. 1c, d, Extended Data Fig. 1g and Supplementary 
Video 3). This vibration-induced facilitation of rolling was mediated 
through the mechanosensory chordotonal neurons (Fig. le). 


First-order multisensory interneurons trigger rolling 

We suspected that the information from the two modalities converges 
onto central neurons involved in the selection of rolling. To identify 
such neurons and thus determine where in the sensory processing 
hierarchy multisensory convergence occurs, we performed a beha- 
vioural screen for neurons whose thermogenetic activation triggers 
rolling (see Methods). We identified a ‘hit’, the R72F11 Drosophila 
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line, that drove GAL4 expression in neurons potentially early in the 
sensory processing hierarchy (Fig. 2a—-c and Extended Data Fig. 2a, b). 
Activating the neurons in R72F11 triggered rolling in a significant 
fraction of animals (Fig. 2a and Supplementary Videos 4 and 5), 
and inhibiting them (see Methods) significantly decreased rolling in 
response to bimodal stimulation (Fig. 2b). 

R72F11 drives expression selectively in four lineage-related, segmen- 
tally repeated projection neurons with basin-shaped arbors in the 
ventral, sensory domain of the nerve cord; we therefore named them 
Basins-1-4 (Fig. 2c, d and Extended Data Fig. 2a—c). The dendrites of 
Basin-1 and Basin-3 span a ventrolateral domain of the nerve cord, 
where the mechanosensory chordotonal terminals are located*”** 
(Fig. 2c, d and Extended Data Fig. 2c, d). The dendrites of Basin-2 
and Basin-4 span both the ventrolateral chordotonal domain and a 
ventromedial domain where the nociceptive MD IV terminals are 
located*”~° (Fig. 2c, d and Extended Data Fig. 2c, d). We therefore asked 
whether the mechanosensory chordotonal and the nociceptive MD IV 
neurons directly converge on Basin-2 and Basin-4. 

In an electron microscopy volume that spans 1.5 nerve cord seg- 
ments, we reconstructed the chordotonal and MD IV arbors (Fig. 2e 
and Extended Data Fig. 2e, f) and all of their synaptic partners (a total 
of 301 neuronal arbors) (Fig. 2e, Extended Data Fig. 3a-d and 
Supplementary Table 4a—c). Amongst the reconstructed neurons we 
recognized the left and right Basin-1, -2, -3 and -4 homologues 
(Fig. 2d, f and Extended Data Fig. 2c, d, g, h). 

Basin-1 and Basin-3 received many inputs (each >25 synapses and 
>15% of total input, on average) from chordotonal neurons, but very 
few (no more than 1% of total input synapses) from MD IV neurons 
(Fig. 2g and Extended Data Fig. 2i, j). Basin-2 and Basin-4 received 
many inputs from both chordotonal neurons (on average >20 
synapses and >10% total input) and MD IV neurons (on average 
>20 synapses and >10% total input) (Fig. 2g and Extended Data 
Fig. 2i, j), each on distinct dendritic branches (Fig. 2h). Of all the 
301 partners downstream of MD IV and chordotonal neurons, only 
Basin-2 and Basin-4 reproducibly received >5 synapses from both 
chordotonal and MD IV neurons (Fig. 2i), suggesting that they are 
probably key integrators of chordotonal and MD IV inputs. 

To investigate whether the observed anatomical inputs from the 
sensory neurons onto Basins were functional and excitatory, we 
imaged calcium transients in response to MD IV or chordotonal 
activation (see Methods) collectively in all Basins (Extended Data 
Fig. 4a—-d) or in individual Basin types (Fig. 3a, b), using lines that 
drive expression selectively in Basin-1 or Basin-4 neurons (Extended 
Data Fig. 4e-h). 

In Basin-1, we observed calcium transients in response to vibration, 
but not in response to MD IV activation (Fig. 3a), consistent with the 
large number of synapses it receives from chordotonal neurons and 
the relatively few from MD IV neurons (Fig. 2g and Extended Data 
Fig. 2i). In Basin-4, we observed calcium transients in response to both 
vibration and MD IV activation (Fig. 3b), consistent with the large 
number of synapses it receives from both sensory types (Fig. 2g and 
Extended Data Fig. 2i). Basin-4 integrated the inputs from the two 
modalities, responding significantly more to bimodal than to unim- 
odal stimulation (Fig. 3b, c and control in Extended Data Fig. 4i, j). 

Next, we asked whether the multisensory Basin-4 interneurons 
contribute to rolling selection. Silencing Basin-4 neurons significantly 
decreased rolling in response to bimodal stimulation, indicating these 
neurons are involved in triggering rolling (Fig. 3d). Selective activa- 
tion of the multisensory Basin-4 interneurons triggered rolling in a 
dose-dependent way, with strongest activation triggering rolling in 
45% of animals (Fig. 3e). 

Wealso wondered whether a second level of multimodal integration 
(that is, integration of information from distinct Basin types, that 
receive distinct combinations of chordotonal and MD IV inputs), 
enhances the selection of rolling. Indeed, co-activation of Basin-1 
with the bimodal Basin-4 facilitated rolling (Fig. 3e), resulting in a 
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Figure 2 | Activation screen and electron microscopy reconstruction reveal 
first-order multisensory interneurons. a, Thermogenetic activation of 
neurons in R72F11 (exposing R72F11-GAL4>UAS-dTrpA_1 larvae to 32 °C) 
triggered rolling. Error bars, 95% confidence interval; **P < 0.0001, ¢ test 
with Holm-Bonferroni correction. b, Silencing neurons in R72F11 significantly 
reduced rolling in response to bimodal stimulation (in MD IV-LexA>lexAop- 
dTRPA1, R72F11-GAL4>UAS-Kir animals). Ch, chordotonal. c, R72F11 drives 
expression in Basin neurons (green, R72F11-GAL4>UAS-GFP). Scale bar, 

25 um. Arrowheads indicate midline. d, Four Basin types visualized with a FLP- 
based strategy are identifiable based on unique combinations of dendritic 
features: medial projection (yellow arrowhead); ventrolateral indentation 
(orange arrowhead); bushy dorsal arbor (red arrowhead); and absence of 
medial projection (white arrowhead). Solid white outlines, neuropil boundary; 


significantly higher likelihood of rolling compared to activation of 
Basin-4 alone (70% versus 45%). Thus, information from distinct 
Basin types may converge again onto downstream neurons involved 
in triggering rolling. 


Goro neurons in motor nerve cord trigger rolling 


To identify potential sites of convergence of information from the 
different types of first-order Basin interneurons we looked at a ‘hit’ 
from the thermogenetic activation screen, R69F06, a Drosophila line 
that drove GAL4 expression in neurons that project far from the early 
sensory processing centres (Fig. 4a-c). Thermogenetic activation of 
neurons in R69F06 triggered rolling in a high fraction of larvae (Fig. 4a 
and Supplementary Video 6), and inhibiting them significantly 
decreased rolling in response to bimodal stimulation (Fig. 4b). 

R69F06 drives expression in a few neurons in the brain, in the sub- 
oesophageal zone (SEZ) and in a pair of thoracic neurons whose 
axons descend through the dorsal, motor domain of the nerve cord 
(Fig. 4c—e). Selectively activating the single pair of thoracic neurons 
(see Methods and Extended Data Fig. 5a—d) triggered rolling in 76% 
of larvae (Fig. 4e, f). We named these command-like neurons Goro 
(a romanization of the Japanese for rolling). 

Activation of Basins evoked strong calcium transients in the Goro 
neurons, indicating that these cell types involved in the same behaviour 
are functionally connected (Fig. 4g, h and Extended Data Fig. 5e, f). 
To identify the shortest anatomical pathways from Basins to Goro that 
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dashed lines, midline. Neuropil is visualised with labelling against N-cadherin 
(N-cad) in c or neuroglian (Ngn) in d. Scale bar, 20 jum. e, Electron microscopy 
reconstruction of all direct synaptic targets of the chordotonal (eight left and 
eight right) and MD IV (three left and three right) neurons in the abdominal 
segment A3. ssTEM, serial section transmission electron microscopy. 

f, Amongst the reconstructed arbors, we recognized Basins, based on 
morphological features (arrowheads). Grey arrow, midline. Scale bar, 10 um. 
g, The fraction of inputs onto Basins originating from chordotonal or MD IV 
terminals. h, Synaptic inputs from MD IV (orange circles) and chordotonal 
(green circles) segregate onto distinct dendritic branches of Basin-2 and Basin- 
4. i, Plot shows the number of synapses from chordotonal and MD IV terminals 
onto each of the 301 downstream partners. L, Basin homologue from the left 
hemisegment; R, Basin homologue from the right hemisegment. 


might support the observed functional connectivity and to determine 
whether the information from distinct Basin types converges onto 
Goro, we again used electron microscopy reconstruction. 


Nerve cord and brain pathways from Basins to Goro 
We used a second electron microscopy volume (from a second larva) 
that spans the entire larval nervous system and therefore also includes 
Goro neurons (Fig. 5a and Supplementary Video 7). In the new volume, 
we reconstructed chordotonal, MD IV and Basin neurons from seg- 
ment A1, as well as the Goro neurons (Fig. 5a—c, Extended Data Figs 2g— 
j and 3a-d, Supplementary Table 5a—c and Supplementary Video 8). 
To find putative pathways from distinct Basin types to Goro neu- 
rons, we first reconstructed all neurons downstream of all axonal 
outputs from the four left and right Basin homologues from segment 
Al (Fig. 5d, e, Supplementary Information (Supplementary Atlas 
section) and Supplementary Tables 5 and 6). We identified 31 pairs 
of reproducible downstream partners (Fig. 5d, e and Supplementary 
Information (Supplementary Atlas section)). Amongst these we iden- 
tified second-order nerve cord interneurons that constitute the short- 
est pathways from Basins to Goro neurons (called A05q and A23g 
where ‘A’ stands for abdominal neuron; Fig. 5d). They receive inputs 
from distinct Basin types and synapse onto Goro neurons (Fig. 5d). 
Thus, information from distinct Basin types, that receive distinct 
combinations of MD IV and chordotonal inputs, converges onto 
Goro neurons—providing a second level of multimodal convergence. 
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Figure 3 | Multiple levels of multimodal integration. a, Calcium transients 
(mean + s.e.m.) are evoked in Basin-1 dendrites by vibration (1,000 Hz, 
6.7ms 7,58, symbolized by green speaker; green trace; n = 7 trials), but not by 
MD IV activation with transgenic ATP-gated channel, P2X3, and local injection 
of ATP (symbolized by orange injection electrode; orange trace; n = 6 trials; 
MD IV-lexA>laxAop-P2X; Basin-1-GAL4>UAS-GCaMP6). b, Calcium 
transients (mean + s.e.m.) are evoked in Basin-4 dendrites by vibration (green 
trace), nociception (orange trace) and combined stimulation (black trace; n = 7 
triplet trials; MD IV>P2X,; Basin-4>GCaMP6); but not by ATP injection in 
the absence of P2X> expression (blue trace, MD IV>+; Basin-4>GCaMP6). 
c, Peak AF/Fy from e shows Basin-4 response to combined stimulation (filled 
black circles) is significantly higher (paired t-test) than the calculated sum of 
unisensory responses (clear black circles). d, Silencing Basin-4 neurons 
significantly reduced rolling evoked by bimodal stimulation (in MD IV- 
LexA>lexAop-dTRPAI, Basin-4-GAL4>UAS-Kir animals) compared to 
controls. Ch, chordotonal. e, Thermogenetic activation of Basin-4 neurons 
evoked rolling (exposing Basin-4-GAL4>UAS-dTrpA1 animals to 30°C or to 
32 °C). Basin-1 facilitated rolling in combination with Basin-4 (exposing 
Basin-4-GAL4, Basin-1-GAL4>UAS-dTrpA1 animals to 30°C or to 32 °C). 
Error bars, 95% confidence interval; **P < 0.0001, x? test with 
Holm-Bonferroni correction. 


Ten distinct second-order projection neuron types downstream of 
Basins ascend to the brain (Fig. 5e and Supplementary Information 
(Supplementary Atlas section)). Some of these integrate Basin 
information across multiple distal segments of the body, either exclu- 
sively from a single Basin type (for example, A00c-a6, Fig. 5e), or from 
distinct Basin types (that receive distinct combinations of sensory 
inputs; for example, A00c-a4 and A00c-a5, Fig. 5e). Then, distinct 
second-order PNs, that receive distinct combinations of Basin inputs 
(and therefore distinct combinations of mechanosensory and noci- 
ceptive inputs), re-converge again on third-order interneurons in the 
brain (Fig. 5f). Thus, following convergence of local mechanosensory 
and nociceptive information from a single segment onto multisensory 
Basins, global mechanosensory and multisensory information from 
multiple segments is integrated within the brain pathway. 

By tracing upstream of Goro dendritic inputs (Fig. 5f and 
Supplementary Atlas) we identified brain neurons which send 
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Figure 4 | Basins are functionally connected to command-like Goro 
neurons. a, Thermogenetic activation of neurons in R69F06 triggered rolling 
(exposing R69F06-GAL4>UAS-dTrpA1 to 32 °C). b, Silencing neurons in 
R69FO06 significantly reduced rolling in response to bimodal stimulation (MD 
IV-LexA>lexAop-dTRPA1, R69F06-GAL4>UAS-Kir). Ch, chordotonal. 

c-e, Images of neurons (green) in R69F06>GFP (c), R69F06,tsh-Gal80>GEP 
(d) and Goro>dTRPA1-myc (R16F11-LexA>lexAop-FLP, R69F06-GAL4>UAS- 
FRT-STOP-FRT-dTRPA1-myc) (e) animals. Scale bar, 25 tm. Arrowheads 
indicate midline. f, Thermogenetic activation of the single pair of Goro neurons 
triggered rolling in 76% of animals (exposing Goro>dTRPA1-myc from the 
RI6F11-LexA>lexAop-FLP, R69F06-GAL4>UAS-FRT-STOP-FRT-dTRPA-myc 
cross to 32 °C); thermogenetic activation of only the brain and SEZ neurons in 
R69FO06 did not (exposing R69F06, tsh-GAL80>dTRPAI to 32 °C). Error bars, 
95% confidence interval; **P < 0.0001, ¢ test with Holm-Bonferroni correction. 
g, Basin activation by local ATP injection (in 72F11>P2X>, R69F06>GCaMP 
animals) evoked calcium transients (mean + s.e.m.) in Goro dendrites (red trace, 
n= 9 trials); ATP injection in the absence of P2X, (in 72F11>+; 
R69F06>GCaMP animals) did not (grey trace, n = 8 trials). h, Summary of the 
identified functional pathway involved in triggering rolling. 


descending axons that synapse onto Goro neurons. Tracing down- 
stream of a multisensory second-order ascending projection neuron 
(A00c-a4), we identified third-order projection neurons connecting 
the ascending pathways from Basins to a descending path onto Goro 
neurons (Fig. 5f). Thus, the activity of the command-like Goro neu- 
rons may be modulated by the more local multisensory and unisen- 
sory information via the nerve cord Basin—Goro pathway and by the 
global body-wide nociceptive and mechanosensory multisensory 
information via the brain Basin-Goro pathways (Fig. 5d-f). 

We also identified a third-order SEZ feedback neuron that receives 
convergent body-wide mechanosensory and multisensory informa- 
tion and descends through the nerve cord sensory domain (Fig. 5g). 
The SEZ feedback neuron synapses onto the first-order (Basins) and 
second-order neurons from both the nerve cord (A05q) and brain 
(A00c) Basin-Goro pathways (Fig. 5g). Both the nerve cord and brain 
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Figure 5 | Electron microscopy reconstruction 
reveals multiple levels of multimodal 
convergence within Basins-to-Goro pathways. 
a, Electron microscopy reconstruction of the 
Basin-Goro anatomical pathways. ssTEM, serial 
section transmission electron microscopy. 
b, Confocal image (green) and electron 
microscopy (EM) reconstruction of a single Goro 
neuron (magenta; see also Supplementary Video 
= 8). Arrowheads indicate midline. Scale bars, 
é 20 um. ch, Reconstructed interneurons (top) and 
' their connectivity (bottom). MD, motor domain; 
| SD, sensory domain. c, Reconstructed Basins and 
' Goro neurons. d, Mechanosensory Basin target 
A23g and multisensory Basin target A05q 
converge onto Goro neurons. Grey, all other Basin 
targets that project to the nerve cord. e, Distinct 
A00c (yellow) ascending projection neurons 
integrate information from distinct combinations 
of Basins across multiple segments. Grey, all other 
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Basin-Goro pathways may therefore be jointly regulated based on 
integrated global multisensory information. 


Function of distinct pathways in triggering rolling 

Next we wanted to explore the functional role of the nerve cord and 
brain Basin—Goro pathways. Basin activation could activate Goro 
neurons in the absence of the brain, suggesting the nerve cord 
Basin-Goro pathway is excitatory and sufficient for activating Goro 
neurons and triggering rolling (Extended Data Fig. 6a, b). Consistent 
with this idea, Basin activation evoked calcium transients in their 
nerve-cord targets, the A05q neurons, and A05q activation evoked 
calcium transients in Goro neurons, (Extended Data Fig. 6c-g). 
Furthermore, thermogenetic activation of the neurons in a line that 
drives expression, amongst others, in the A05q neurons triggered 
rolling (Extended Data Fig. 6c, h). 

Calcium imaging in the terminals of three Basin-target neurons 
that ascend to the brain (A00c-a6, A00c-a5 and A00c-a4) (Fig. 5h) 
revealed that, collectively, they respond to vibration, to MD IV activa- 
tion, and to Basin activation, suggesting that this connection is also 
excitatory (Fig. 5i and Extended Data Fig. 6i, j). 

Silencing the A00c neurons decreased rolling in response to bimo- 
dal stimulation (Fig. 5j), and their co-activation with Basin-4 facili- 
tated rolling (Fig. 5k). Therefore, downstream from early local 


and nociceptive MD IV 


i, Calcium transients (mean + s.e.m.) in the 
terminals of all A00c cells (in MD IV>P2X3, 
R71A10>GCaMP animals) shows they respond to 
MD IV activation by local ATP injection (orange 
trace, n = 12 trials) and vibration (green trace, 
n= 4 trials). j, Silencing A00c neurons 
significantly reduces rolling in response to 
bimodal stimulation (in MD IV-LexA>lexAop- 
dTRPAI1, R71A10-GAL4>UAS-Kir animals). Ch, 
chordotonal. k, A00c neurons facilitate rolling in 
combination with Basin-4 (exposing Basin-4- 
GAL4, 71A10-GAL4>UAS-dTrpA1 animals to 
30°C or to 32 °C). Error bars, 95% confidence 
interval; **P < 0.0001, x? test with Holm- 
Bonferroni correction. 
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multisensory integration by Basin-4, additional levels of integration 
of global mechanosensory and multisensory information appear to 
further facilitate the transition to rolling behaviour. 


Differential effects on triggering roll and crawl 

The rolling response triggered by multisensory cues (or by strong 
nociceptive cues alone) is followed by fast crawling** (Extended 
Data Fig. le, f). Similarly, optogenetic*® activation of the first-order 
multisensory Basin-4 neurons triggered both locomotor modes; roll- 
ing followed by fast crawling (Extended Data Fig. 7a—c). However, 
optogenetic activation of the Goro neurons triggered only rolling, but 
not fast crawling, suggesting that they act as dedicated command-like 
neurons for rolling (Extended Data Fig. 7a-c). This also suggests 
that the act of rolling itself is insufficient to trigger fast crawling. In 
the future it will be interesting to determine how all of the 31 novel 
neuron types directly downstream of the Basins identified in our 
electron microscopy reconstruction (Supplementary Information 
(Supplementary Atlas section)) contribute to the selection of the 
two locomotor modes, rolling and crawling, in a defined sequence. 


Discussion 


By combining behavioural and physiological studies with large-scale 
electron microscopy reconstruction we mapped a multisensory circuit 
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Figure 6 | Summary of the multilevel multimodal convergence circuit for 
rolling. a, Whole-animal circuit. Ch, chordotonal. b, c, Specific nerve cord 
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that mediates the selection of the fastest mode of escape locomotion 
(rolling) in Drosophila larva (Fig. 6a, b). We find that mechanosen- 
sory and nociceptive sensory neurons converge on specific types 
of first-order multisensory interneurons that integrate their inputs. 
Then, interneurons that receive distinct combinations of mechano- 
sensory and nociceptive inputs converge again at multiple levels 
downstream, all the way to command-like neurons in the nerve cord. 
Activating just a single type of first-order multisensory interneuron 
triggers rolling probabilistically. Co-activation of first-order inter- 
neurons that receive distinct combinations of mechanosensory and 
nociceptive inputs increases rolling probability. Thus, action selection 
starts at the first-order multisensory interneurons and multiple stages 
of multimodal integration in the distributed network enhance this 
selection. 

Given that spurious firing from distinct sensors is uncorrelated, 
whereas event-derived signals will be temporally correlated across 
the sensory channels, multimodal integration even at a single level 
improves the signal-to-noise ratio’**’. Multilevel multimodal integ- 
ration can offer additional advantages. Theoretical studies show that a 
multilevel convergence architecture enables more complex input- 
output relationships’’°. Similarly, the multilevel multimodal conver- 
gence architecture described here could offer better discrimination 
between different kinds of multisensory events. The weights in such 
networks could be tuned either through experience or through evolu- 
tion to respond selectively to highly specific combinations of two cues. 
Using a simple model, we can demonstrate (Extended Data Fig. 8a—k) 
that compared to early-convergence a multilevel architecture could 
specifically enhance the selection of the fastest escape mode in the 
most threatening situations, either in response to weak multimodal or 
strong unimodal nociceptive cues. 

The multilevel multimodal convergence architecture may be a gen- 
eral feature in multisensory integration circuits, enabling complex 
response profiles tunable to specific ecological needs. For example, 
physiological studies in mammals have identified multisensory neu- 
rons that integrate the same cues at several stages in the sensory 
processing hierarchy’*””?!**, although it is unclear whether the mul- 
tisensory neurons at distinct levels are causally related to the same 
behaviour. Due to the size of networks involved, synaptic-level reso- 
lution studies of the underlying convergence architecture across mul- 
tiple levels were unattainable. 


638 | NATURE | VOL 520 | 30 APRIL 2015 


In addition to the multilevel multimodal feed-forward convergence 
motif, our electron microscopy reconstruction revealed higher-order 
and local feedback neurons (Figs 5g and 6a and Supplementary 
Information (Supplementary Atlas section)). Recent theoretical mod- 
els of multisensory integration suggest that the output of individual 
multisensory neurons is normalized by the activity of other multi- 
sensory neurons in that population, but the anatomical implementa- 
tion of such feedback has not been identified***. Some of the feedback 
neurons in the multisensory circuit described in this study may have 
roles in such normalization computations. 

Another circuit motif revealed by our study is the divergence of 
sensory information into nerve cord and ascending brain pathways 
and subsequent re-convergence of the shorter and the longer pathway 
onto the same command-like neurons in motor nerve cord (Goro). 
The nerve cord pathway integrates nociceptive and mechanosensory 
information from a local region of the body (few segments), whereas 
the ascending brain pathway integrates the information across all 
body segments and provides a means of modulating command-like 
neuron activity based on global body-wide nociceptive and mechano- 
sensory information. The multisensory circuit described here in a 
genetically tractable model system provides a resource for investi- 
gating in detail how multiple brain and nerve cord pathways interact 
with each other and contribute to the selection of different modes of 
locomotion (rolling and crawling) in a defined sequence. 

The electron microscopy volume spanning the entire insect nerv- 
ous system acquired for this study can be used to map circuits that 
mediate many different behaviours. Combining information from a 
complete wiring diagram with functional studies has been very fruitful 
in the 302-neuron nervous system of C. elegans***°. Recently, similar 
approaches have been applied to microcircuits in smaller regions of 
larger nervous systems*”°. In this manuscript, we demonstrate that 
relating local and global structure to function in a complete nervous 
system is now possible for the larger and more elaborate nervous 
system of an insect. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Fly strains. In the main text and figures, short names are used to describe geno- 
types for clarity. The complete genotypes of animals used in this study are shown 
in Supplementary Table 2. 

The following strains from the Rubin GAL4/LexA collection were used for the 
figures in the manuscript: R61D08-GAL4, R69F06-GAL4, R72F11-GAL4, 
R20B01-GAL4, R71A10-GAL4, R57F07-GAL4, R47D07-GAL4, R16E11-LexA, 
R61D08-LexA, R72F11-LexA and the control stock w;;attp2 with no GAL4 or 
LexA***". For the thermogenetic activation screen we selected 586 GAL4 lines’® 
that drove expression in relatively few neuron types (one to five) and used them to 
drive expression of TRPA with UAS-dTRPA1**?. 

The following effector stocks were used: pJFRC25-13xLexAop2-IVS-dTrpA1 in 
vk05°°**;  pJFRC48-13xLexAop2-IVS-mrytdTomato in attp18, pJFRC20- 
8xLexAop2-IVS-GAL80-WPRE in attp18°°*; pJFRC29-10xUAS-IVS-myr::GEP- 
p10 in attp2°; pGP-JFRC7-20xUAS-IVS-GCaMP6s* (Bloomington stock no. 
42749); pGP-JFRC7-20xUAS-IVS-GCaMPé6f5 (Bloomington stock no. 52869); 
20xUAS-IVS-GCaMP3 (Bloomington stock no. 32236;°°); and UAS-JFRC12- 
10xUAS-mryGFP (Bloomington stock no. 32197;°!**) were gifts from B. D. 
Pfeiffer, G. Rubin, and the GECI project team (HHMI Janelia Research 
Campus). ppk1.9-GAL4” was a gift from D. Tracey (Duke University). ppk- 
1kb-hs43-lexA-GAD in attp40 was a stock that was previously described in ref. 
34. UAS-dTRPA1* was a gift from P. Garrity. UAS-EGFP-Kir2. 1** was outcrossed 
to w'"”8 and gifted from D. Sitaraman (Yale University). LexAop2-P2X,* was a 
gift from C.-H. Yang (Duke University; unpublished stock). 

For selective targeting the Goro neurons, we used LexAop-FLP; 
UAS>stop>dTrpAl::myc® a gift from B. Dickson, crossed with w’”%;16E11- 
LexA (attp40); R69F06-GAL4 (attp2), which drives expression in Goro neurons. 
The expression was determined by anti-Myc antibody (seen on immunohisto- 
chemistry). 

Tsh-LexA was a gift from J.-M. Knapp and J. Simpson (unpublished stock). In 
brief, the tsh-LexA driver is an enhancer trap inserted into the 5’ UTR of the tsh 
locus. It was generated via a P-element swap that replaced the p{GawB} insertion 
of tsh-GAL4°! with P{UpP65L} and the enhancer-trap LexA construct. Proper 
targeting and orientation of P{UpP65L} were confirmed by splinkerette PCR and 
sequencing by J.-M. Knapp (unpublished results). 

LexAop2-mry-TDTomato-p10 (attp40), a gift from D. Mellert (HHMI Janelia 
Research Campus; unpublished stock), is an mry::TDtomato fragment with 
AcNPV p10**. To selectively target Basin-4 neurons we generated a Split- 
GAL4*° stock R72F11_AD inserted in attp40 (chromosome 2L) and 
57F07_DBD in attp2 (3L). AD and DBD combinations were assembled in a 
w''8 background. We identified R57F07 as a candidate line for this intersection 
because its activation also evoked rolling in the thermogenetic activation screen 
(Extended Data Fig. 4). Stochastic labelling of single cells in R57F07 and R72F11, 
using a FLP-based strategy (see below) revealed they both drive expression in 
Basin-4. R57FO07 also drove expression in other neurons in the tip of the nerve 
cord and brain (Extended Data Fig. 4). 

The ‘FLP-out’ approach® for stochastic single-cell labelling will be described in 

detail elsewhere (Nern et al., manuscript in preparation). In brief, heat-shock 
induced expression of FLP recombinase was used to excise FRT-flanked inter- 
ruption cassettes from UAS reporter constructs carrying HA, V5, and Flag epi- 
tope tags, and stained with epitope-tag specific antibodies. This labelled a subset 
of the cells in the expression pattern with a stochastic combination of the 
three labels. 
Behavioural apparatus. The apparatus comprises a video camera (DALSA 
Falcon 4M30 camera) for monitoring larvae, a ring light illuminator (Cree 
C503B-RCS-CWOZO0AA1 at 624 nm in the red), a computer (see ref. 28 for details; 
available upon request are the bill of materials, schematic diagrams and PCB 
CAM files for the assembly of the apparatus) and two hardware modules for 
controlling vibration and temperature (Oven Industries PA, Model 0805). Both 
hardware modules were controlled through multi worm tracker (MWT) software 
(http://sourceforge.net/projects/mwt), as described in ref. 28. 

The module for presenting vibration consists of a speaker (120 W, 12 inches in 
diameter) and an amplifier (Pyle Pro PCA3) controlled by a signal generator 
(Tektronix AFG3021B). The speaker was used to generate vibrations between 
100 and 1000 Hz, with an electronic signal (1 V, 100 Hz) generating a sound 
pressure level of 122 dB at the assay plate (measured by a Realistic 33-2050 
Sound Level Meter). The frequency of vibration at the agar surface and vibration 
intensity (measured as acceleration in ms ~) was measured with an ADXL345 
3-axis accelerometer assembled onto a small printed circuit board placed on the 
agar as described in ref. 28. 

For thermal activation, we used a Peltier module controlled by a temperature 
controller (Oven Industries PA, Model 0805). An aluminium plate covered with a 
thin layer of 4% charcoal agar was placed on the Peltier module. The temperature 


of the agar was raised from 20°C to 30 °C/32 °C to weakly/strongly activate the 
dTRPA1 channel. By using a temperature gun, we confirmed that the agar 
reached the Peltier tile setting temperature. The time to reach the target temper- 
ature ranged from 3 to 8 s. The behaviour rigs were inside 32.00” wide X 28.00” 
deep X 60.00” high temperature-controlled enclosures with temperature settable 
from ambient to 40°C in 0.1°C steps (Life Science Engineering, Inc.). The 
humidity in the room was monitored and held at 58%, with humidifiers 
(Humidifirst Mist Pac-5 Ultrasonic Humidifier). 

Behavioural experiments. Embryos were collected for 6h at 18°C and 65% 
humidity. Larvae containing the UAS-dTrpA1 or LexA-dTrpA1 transgene were 
raised at 18 °C for 7 days with normal cornmeal food. Foraging third instar larvae 
were used for all experiments. Larvae containing the UAS-CsChrimson transgene 
were grown in the dark at 25°C for 4 days on fly food containing trans-retinal 
(Sigma, R2500) at a concentration of 500 M. 

Before the experiments, the larvae were separated from food by using 15% 
sucrose and washed with water. The larvae were then dried and placed in the 
center of the arena. The substrate for the behavioural experiments was 3% Bacto 
agar gel in a 25 X 25cm square plastic plate for experiments involving thermal 
activation and vibration stimuli, or a 10 X 10 cm plate for those involving thermal 
activation alone. We tested approximately 15-50 larvae at once in the behavioural 
assays. The temperature of the entire rig was kept at 25°C (for optogenetic 
activation experiments) or 30°C or 32°C for thermogenetic activation experi- 
ments. The agar plates were also kept at the desired temperature prior to experi- 
ment. For optogenetic activation experiments we exposed dishes of larvae to red 
light (630 nm, 2.4mW per cm/). Optogenetic activation offers greater temporal 
precision than thermogenetic activation so we used to study the ability of Basin, 
Goro and A00c neurons to trigger the entire behavioural sequence roll-fast crawl, 
or just a single element of the sequence (roll or crawl) 

The MWT software™ (http://sourceforge.net/projects/mwt) was used to record 
all behavioural responses and to control the presentation of vibration stimuli. 

For the thermogenetic activation screen we used 586 GAL4 lines preselected for 
expression in relatively few neuron types (one to five) to drive expression of 
dTRPAI. In each experiment, we exposed dishes of larvae to 32°C to thermo- 
genetically activate dTRPA1-expressing neurons. We detected 32 ‘hits’ that con- 
tained neurons whose thermogenetic activation evoked rolling in a significant 
fraction of animals. We analysed further hits that evoked rolling in the highest 
fraction of larvae and that drove expression in the least neuron types. 
Behaviour quantification. Larvae were tracked in real-time using the MWT 
software. We rejected objects that were tracked for less than 5 seconds or moved 
less than one body length of the larva. For each larva MWT returns a contour, 
spine and centre of mass as a function of time. Raw videos are never stored. From 
the MWT tracking data we computed the key parameters of larval motion, using 
specific choreography (part of the MWT software package) variables”*. From the 
tracking data, we detected and quantified crawling and rolling events and the 
speed of peristaltic crawling strides as described in ref. 28, using the LARA 
software package (http://sourceforge.net/projects/salam-hhmi). We also trained 
a machine-learning classifier, using JAABA (Janelia Automatic Animal Behavior 
Annotator*) to detect rolling events from the tracking data. The false detection 
rate of the JAABA rolling classifier was slightly smaller (2.8%, n = 89) than that of 
the LARA detection software (3.6%), but the results obtained with both methods 
were very similar. The MATLAB code for the rolling classifier is available upon 
request. The data presented is a quantification of rolling events detected with 
JAABA. To calculate the percentage of animals that rolled in response to thermo- 
genetic activation, vibration, or both, we selected for analysis all the animals that 
were tracked throughout the entire 15-second sampling interval following stimu- 
lation. We then calculated the percentage of animals from this sample for which at 
least one rolling event was detected during the 15-second interval. The instant- 
aneous percentage of rolling and crawling larvae (Extended Data Fig. 7a) as a 
function of time was calculated by dividing the number of larvae performing 
rolling at each time point with the total number of larvae tracked at each 
time point. 

Selective targeting of Goro neurons. We expressed FLP selectively in the Goro 
neurons using the weak 16E11 LexA lines (in 16ElexA/LexAop FLP; 
UAS>stop>UAS-dTRPA1::myc/R69F06). Selectively in Goro neurons, FLP 
stochastically excises the STOP cassette that prevents the expression of UAS- 
dTRPA1. Now the strong R69F06 promoter can drive high levels of dTRPA1 
expression, selectively in Goro neurons. Since this method of Goro activation is 
stochastic we monitored rolling in response to 32°C exposure in individual 
larvae. We dissected all animals, noted whether they rolled or did not, and stained 
their nervous systems with anti-Myc, to identify those that had Goro expression. 
Statistical analysis. Statistical analysis was performed using MATLAB 
(MathWorks) software. The calcium responses in the paired trials shown in 
Fig. 4f were compared both using the paired t-test and Wilcoxon signed-rank 
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test. Required sample size for power 0.8 was determined using GPower software 
for paired t-test and for a large pre-specified effect size based on a few pilot trials. 
Indeed the effect size in the data was large (P = 0.016 and d = 1.05 for one-sided 
paired t-test and P = 0.02 and r= 0.81 using Wilcoxon signed-rank test). The 
speed of peristaltic crawling strides was compared using Wilcoxcon rank sum test 
(and Bonferroni correction for multiple comparisons). In all figures, * and ** 
represent P < 0.05 and P< 0.01, respectively. Required sample size for power 0.8 
was determined using GPower software for a medium pre-specified effect 
size. The fractions of animals that rolled under the different experimental con- 
ditions were compared using the x7 test. P values were corrected for multiple 
comparisons using Bonferroni correction. Power analysis was performed as 
described below. 

We performed an analysis of the power of our statistical tests of behavior 
differences, given the sample sizes and effect sizes observed. In addition, we 
computed the minimum effect sizes our analyses would have been sensitive to 
with a power 0.95, given the sample sizes. In our power analyses, we assumed that 
the distribution of the fraction of larvae that rolled for the control populations, qo, 
was known and equal to the fraction measured in a large number of trials. 

For each experiment, we tested the null hypothesis that the fraction of experi- 
mental animals that rolled was less, more, or equal to the control fraction, 
depending on the type of experiment. For example, when activating the Goro 
neurons, we tested whether we could reject the null hypothesis that the fraction of 
larvae that rolled was less than or equal to the control fraction. In this case, 
assuming that each animal is an independent sample from the same distribution, 
the number of larvae rolling follows a binomial distribution. Thus, if the null 
hypothesis is that the experimental larvae roll at most as much as control, the 
probability that the null hypothesis holds can be computed from the binomial 
cumulative distribution function: 


Pr (n,gn) = 1—binocdf(qn,qo) 
<40 


Where n is the number of experimental larvae tested, q is the measured fraction 
that roll, and binocdf (k,r) is the binomial cumulative distribution function. We 
assumed we would reject the null hypothesis if this probability was less than 0.01. 
For a given go and n, we can thus compute oo.n? the minimum gq such that we 
would reject the null hypothesis using this equation. 

To compute the power of this statistical test given the sample size n and true 
fraction of experimental larvae expected to roll q, we assumed that our collected 
experimental data were independent draws from the Bernoulli distribution with 
parameter q, and computed the probability that our observed fraction of 


Pr(reject null hypothesis|qo <q, n) = 1—binocdf (Gan n, 40) 


Analogous equations were used for testing whether the experimental rolled at 
least as much as control or the same amount as control. 

Using these equations, we computed the power of each statistical test given the 

observed sample size n, control fraction that rolled qo, and the experimental 
fraction that rolled q, for each set of experiments. We also computed the min- 
imum effect size q / qo that would be sensitive with power 0.95, given the observed 
sample size n and control fraction that rolled qo. These results are listed in 
Supplementary Tables 1 and 3. 
Immunohistochemistry. The following antibodies were used: rabbit anti-GFP 
(1:1000, A11122, Invitrogen), rat anti-N-Cad (1:50, Developmental Studies 
Hybridoma Bank), mouse anti-nc82 (1:20; Developmental Studies Hybridoma 
Bank), mouse-anti-Myc (9E10, 1:20, Developmental Studies Hybridoma Bank), 
rabbit anti-HA-Tag (1:300, Cell Signaling, #3724), rat anti-FLAG (1:200, 
NOVUS, NPP1-06712), mouse anti-V5 Dylight549 (1:400, AbD Serotec, 
MCA1360D549). For secondary antibody Alexa Fluor 488 goat anti-rabbit IgG 
(1:200, Invitrogen), Alexa Fluor 568 goat anti-mouse IgG (1:200, Invitrogen) and 
Alexa Fluor 488 goat anti-mouse IgG (1:200, Invitrogen) were used. 

The central nervous systems of wandering third instar larvae were dissected in 
cold Schneiders Insect medium (Sigma, $0146) and fixed in 4% phosphate-buf- 
fered saline (PBS) with 4% paraformaldehyde (Electron Microscopy Science) at 
room temperature for 20 min. After washes in 0.4% Triton X-100 in PBS, samples 
were preblocked with 3% normal goat serum (NGS) in for 30min at room 
temperature. The samples were then incubated in primary antibody for 2 to 3 days 
at 4°C. After extensive washes in 0.4% Triton X-100 in PBS, the samples were 
incubated at 4°C for 2 days in a secondary antibody solution. After further 
extensive washes, the samples were rinsed in PBS, dehydrated through an 
EtOH series, cleared in Xylene and mounted in DPX mountant (Sigma- 
Aldrich). Images were taken with a 710 or 700 laser-scanning confocal micro- 
scope (Zeiss) under 20X 0.8 NA or 63X, 1.4 NA objective and processed in 
Image] software® (NIH). 
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Functional connectivity assay (GCaMP recording). For activation of presynap- 
tic neurons (MD IV, chordotonal, Basins or A05q) with the ATP-gated P2X, 
channel and imaging in postsynaptic neurons (Basins, Goro, A00c or A05q), the 
central nervous system of wandering third instar larvae were dissected out in cold 
physiological saline containing 103 mM NaCl, 5 mM KCl, 5 mM HEPES, 26 mM 
NaHCO;, 1 mM NaH, PO,,5 mM Trehalose, 6 mM sucrose, 2 mM CaCl, 2 mM 
HO, 8mM MgCl, 6 mM H,0, and kept stable by sticking them on poly-.-lysine 
(SIGMA, P1524) coated cover glass placed in small Sylgard (Dow Corning) plates. 
A borosilicate pipette (1 to 2 1m tip diameter) filled with 5mM adenosine 5’- 
triphosphate (ATP, Amersham 27-1006) and 50g ml! Alexa Fluor 594 hydra- 
zide (Invitrogen, A10438) dye in physiological saline was positioned 3 to 5 um 
from the axon terminals of the target presynaptic neurons expressing the ATP- 
gated P2X, channel and dsRed and the ATP solution was pressure-ejected using a 
Picospritzer (Parker Hannifin Co.). 

When ejecting ATP onto the axon terminals of MD IV or chordotonal neurons, 
we imaged calcium transients in the dendritic arbor of the Basin neurons, not the 
cell bodies (which in the insect are removed from the neuropil). When stimulat- 
ing Basins in abdominal segments, by ejecting ATP onto their axon terminals in 
ventral nerve cord, we imaged calcium transients in the dendrites of Goro (in 
dorsal thoracic nerve cord), or in axon terminals of A05q (in dorsal nerve cord) 
and A00c (in the brain). When stimulating A05q by ejecting ATP onto their 
dendrites in ventral nerve cord in abdomen, we imaged calcium transients in 
Goro dendrites in dorsal thoracic nerve cord. 

Puffs were 25 ms in duration, and the pressure was adjusted so that red fluor- 
escence (from the Alexa 594) ejection could be seen clearly while imaging. 
The same stimulus was presented every 30 sec for a total of three trials in each 
experiment. 

When vibration stimuli were also used in experiments, rather than only P2X, 
activation, the larvae were dissected using methods similar to those described 
previously for neuromuscular junction” on the Sylgard plate. In brief, larvae were 
cut up along the dorsal body wall midline and then opened by placing dissection 
pins. The cell bodies of abdominal peripheral sensory neurons and their nerves 
were kept intact. The anterior tips were removed, and the CNS was attached to a 
poly-L-lysine coated cover glass. For vibration stimuli in the two-photon rig, we 
embedded piezo extension actuators (Piezo System Inc., Model Q220-A4-303XE) 
in the Sylgard plate and used TTL pulses from the Prairie View software to control 
the timing of stimulation. The acceleration was measured with an ADXL345 
3-axis accelerometer’®. The vibration stimuli were 1,000Hz and 6.67ms ’, 
15 sec long. The same stimulus was presented every 30sec for a total of three 
trials in each experiment. For comparing the responses of Basin-4 to combined 
activation (nociceptor activation with P2X, and ATP injection plus vibration) to 
nociceptor activation or vibration alone (Figure 4d and e), each trial consisted of 
ATP injection, vibration, and a combination (ATP injection and vibration), with 
30 sec between each stimulation. Three such triplet trials were presented in each 
experiment. 

We performed imaging on a Prairie Technologies two-photon microscope 
using an Olympus 60X 0.8 NA objective. A mode-locked Ti:Sapphire 
Chameleon Ultra II laser (Coherent) tuned to 925 nm was used for excitation. 
Fluorescence was collected with photomultiplier tubes (Hamamatsu) after band- 
pass filtering. Images were acquired in frame-scan mode (maximum speed, 
1,215 f.p.s.) for a single plane of the CNS. 

GCaMP image analysis. Image data were processed by Image] software (NIH) 
and analysed using custom code written in MATLAB (MathWorks). Specifically, 
regions of interest (ROIs) were determined by averaging the ten frames before 
stimulation and segmenting this data by the function MEAN® in ImageJ. The 
mean intensity of the ROI was measured in ImageJ. In all cases, changes in 
fluorescence were calculated relative to baseline fluorescence levels (Fy) as deter- 
mined by averaging over a period of at least 3 sec just before ATP puff or vibration 
stimulation. AF/Fo values were calculated as AF/Fy = (F; — Fo)/Fo, where F, is the 
fluorescent mean value of a ROI in a given flame. Peak AF/Fp is the maximum 
value in the 5 sec window following stimulus onset. 

Electron microscopy preparation and imaging. For generating the electron 
microscopy dataset comprising 1.5 abdominal segments (segment a2 and a3), 
the central nervous systems from 12-24-h-old wild-type w’””* first instar larvae 
were manually dissected out in PBS, and immediately transferred to 2% glutar- 
aldehyde in 0.1 M sodium cacodylate, pH 7.4 buffer. Samples were post-fixed in 
1% OsO, in the same buffer and stained en bloc with 1% aqueous uranyl acetate 
before subsequent dehydration in ethanol and propylene oxide, and embedding 
in Epon. Serial 45 nm sections were cut with a Leica UC6 ultramicrotome using a 
Diatome diamond knife, and picked up with Synaptek slot grids with Pioloform 
support films. Sections were stained with uranyl acetate followed by Sato’s lead®. 

For generating the electron microscopy dataset comprising the entire central 
nervous system, central nervous systems from 6-h-old [iso] Canton S G1 x w'18 
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[iso] 5905 female larvae were manually dissected out in PBS. The isolated central 
nervous systems were immediately transferred to 125 pl of 2% glutaraldehyde in 
0.1M sodium cacodylate buffer, pH7.4 in a 0.5 dram glass vial (Electron 
Microscopy Sciences, cat. no. 72630-05) to which an equal volume of 2% OsO, 
in 0.1 M sodium cacodylate buffer, pH 7.4 was added and gently mixed. Each CNS 
was then fixed in an ice bath in a Pelco BioWave PRO microwave oven (Ted Pella, 
Inc.) at 350-W, 375-W and 400-W pulses for 30sec each, separated by 60-sec 
intervals with the microwaves off. Samples were rinsed 3 X 30 sec at 350 W with 
0.1M sodium cacodylate buffer, separated by 60-sec intervals with the micro- 
waves off, and post-fixed with 1% OsO, in 0.1 M sodium cacodylate buffer at 350- 
W, 375-W and 400-W pulses for 30 sec each, separated by 60-sec intervals with 
the microwaves off. After rinsing with distilled water 3 X 30 sec at 350 W with 60- 
sec pauses between pulses, the samples were stained en bloc with 1% uranyl 
acetate in water by microwave at 350 W for 3 X 30 sec with 60-sec pauses between 
microwave pulses. The samples were then dehydrated in an ethanol series fol- 
lowed by propylene oxide, infiltrated and embedded with EPON resin. Serial 50- 
nm sections were cut with a Leica UC6 ultramicrotome using a Diatome diamond 
knife, and picked up with Synaptek slot grids with Pioloform support films with 
2nm of carbon (C). Sections were stained with uranyl acetate followed by 
Sato’s lead”. 

Both data sets were imaged using Leginon”, the 1.5 abdominal segment on an 

FEI T20 TEM (Hillsboro) at 4.4nm X 4.4nm pixel resolution, the complete L1 
brain on an FEI Spirit TEM at 3.8nm X 3.8 nm pixel resolution. The resulting 
images were montaged and registered using the nonlinear elastic method 
described in ref. 23. 
Neuronal reconstruction. Reconstructions were made in a modified version of 
CATMAID” (http://www.catmaid.org). The process of reconstructing neurons 
in CATMAID involves starting with a specified neurite in a section of the electron 
microscopy data, and manually building a 3D skeleton representation of the 
neuronal morphology and the location of synaptic active zones and synaptic 
partners. 

The synaptic connections mapped and reported in this study all represent fast, 
chemical synapses. Synapses must match several criteria over multiple adjacent 
sections: a thick black active zone, vesicles, and presynaptic specializations, such 
as a T-bar, on the presynaptic side, and evidence of postsynaptic membrane 
specializations”. 

To identify sensory axons, reconstruction was initiated at the entry point of 
nerves into the neuropil. Sensory neurons project ventrally in the nerve cord, and 
motor neurons project dorsally*”**”, so the sensory axons were readily identified 
in the ventral portion of the nerve root. All sensory axons were traced and MD IV 
and chordotonal axons were recognized based on their distinguishing features, 
previously described using confocal microscopy imaging*”**’*. Chordotonal pro- 
jections were readily identified based on the characteristic intermediate location 
of their terminals in the mediolateral axis and the dorsoventral axis of the nervous 
system. We found that all eight chordotonal axons were already tightly bundled 
together in the nerve root. The MD IV projections were readily recognized based 
on the characteristic medial-most and ventral-most location of their terminals. 
Neuronal partners of the eight chordotonal axons and the three MD IV axons in 
both hemisegments were reconstructed by initiating reconstructions at the site of 
every postsynaptic target in all chordotonal terminals until all targets were 
exhausted. A review of all skeletons was done by CMSM to identify incorrect 
or missed branches and synapses. 

In the comprehensive L1 electron microscopy volume, Basin cells in Al and A4 
segments were found using landmarks such as cell body position, the entry point 
of primary neurite into the neuropil, and branch points on the primary neurite. In 
brief, the Basin lineage was first recognized on the basis of the characteristic cell 
body location (lateral intermediate), the trajectory of their primary neurite which 
is parallel to the entering nerve, and the neuropil entry point of the lineage, as well 
as the pattern of the low-order, large branches. The lineage contains more cells 
than just the four Basins, but the other cells have very different projection pat- 
terns. The Basins in the Al segment were identified by tracing the primary and 
secondary neurites of multiple members of the lineage and abandoning those that 
did not have the distinguishing features of the Basins. The four Basins in each 
hemisegment were reconstructed in full and individual Basins recognized based 
on their unique features (see Extended Data Fig. 2). Basin targets were recon- 
structed by tracing exhaustively downstream of all presynaptic sites in Basin 
axons. Goro neurons were found in a similar way as Basins. Starting with high- 
resolution 3D image stacks of single cells (visualized using the FLP-based strategy 
described above), we looked at the cell body location, primary neurite projection, 
and major branch points. We then traced neurites consistent with the primary 
neurite projection and cell body location until finding only two individual cells, 
one in each hemisegment of segment T2, with branch points consistent with Goro 
neurons. Completed reconstructions were then checked for consistency with light 


data in axon and dendritic projections. Partial reconstructions of sibling neurons, 
as well as homologous neurons in adjacent thoracic segments, resulted in 
morphologies very different from that of Goro neurons as observed by light 
microscopy, indicating that only two neurons in the whole of the thorax match 
the expected morphological pattern. Descending neurons were found by recon- 
struction of neurons upstream of synaptic inputs in Goro dendrites. A subset of 
fourth-order neurons was found by reconstructing downstream of all synaptic 
output sites of only one of the ascending third-order A00c-a4 neurons. Left and 
right homologues of reconstructed neurons were identified by considering a suite 
of morphological and synaptic features, focusing on neuropil entry location, 
primary branch locations, and the location of pre- and postsynaptic sites. 

Neuronal morphologies were analysed in MATLAB using custom scripts. 
Circuit diagrams were made in Gephi” and edited in Adobe Illustrator CS6 
(Adobe Systems, Inc.). 

Validation of methodology of electron microscopy reconstruction. In 
Drosophila, as in other insects, the gross morphology of many neurons is stereo- 
typed and individual neurons are uniquely identifiable based on morpho- 
logy’”*°’*’’, Furthermore, the nervous system in insects is largely bilaterally 
symmetric and homologous neurons are reproducibly found on the left and 
the right side of the animal. In this study we therefore validate the wiring diagram 
by: (1) independently reconstructing partners of homologous neurons on the left 
and right side of the nervous system; (2) independently reconstructing partners of 
homologous neurons in EM volumes from two different individuals; (3) perform- 
ing functional connectivity experiments to validate some key connections pre- 
dicted by the wiring diagram. 

Validating the wiring diagram by independent reconstruction on the left and 
right. By randomly re-reviewing annotated synapses and terminal arbors in our 
dataset we estimated the false positive rate of synaptic contact detection to be 
0.0167 (1 error per 60 synaptic contacts). Assuming the false positives are uncor- 
related, for an n-synapse connection the probability that all n are wrong (and thus 
that the entire connection is a false positive) occurs at a rate of 0.0167”. Thus, the 
probability that a connection is false positive reduces dramatically with the num- 
ber of synaptic connections contributing to that connection. 

Even for n = 2 synapse connections, the probability that the connection is not 
true is 0.00028 (once in every 3,586 two-synapse connections) and we call con- 
nections with two or more connections ‘reliable’ connections. Similarly, the 
probability that even a one-synapse connection annotated between homologous 
neurons both on the left and on the right side of the nervous system is not true is 
0.00028. The probability that a false-positive connection with two synapses is 
observed twice between the same cell types is even lower (0.000282). Because 
false-positive synapses between a single pair of homologous neurons could poten- 
tially be correlated (due to mis-annotation of a single branch with multiple con- 
tacts), we perform extra rounds of focused re-review of the arbors associated with 
connections in our wiring diagram with less than four synapses to confirm that 
they are neither mis-annotated synapses or incorrect branches. Furthermore, 
while false positives between a single pair of homologous neurons could poten- 
tially be correlated, the false positives between different pairs of homologous 
neurons (on the left side and the right side) cannot be correlated (they are not 
on the same branch). Thus the probability that reliable (with two or more 
synapses) and reproducible (found between both the left and right homologues 
within an individual) connections are not true is extremely low and we are highly 
confident in such connections. All the connections discussed in this study and all 
the principles derived from the wiring diagram are based on such reliable and 
reproducible connections. 

False negatives are far more common in our reconstructions than false posi- 

tives. We were able to trace 81.5% (1,676/2,054) of postsynaptic sites downstream 
of Basins back to identifiable neuronal arbors, leaving 18.5% (378/2,054) of 
synaptic outputs associated with small, unidentifiable arbor fragments due to 
ambiguities in the data. The majority have only a single synaptic input from 
Basins (163/231 fragments), and only 8/231 fragments have five or more Basin 
inputs. If the synapses that could not be associated with identified neurons were 
distributed randomly, the likelihood of completely omitting a connection with 
n synapses is approximately 0.185", which is 0.006 for n = 3. Thus the like- 
lihood that we have completely omitted a three-or-more-synapse connection is 
quite low. 
Reproducibility of the wiring diagram across different individuals. For a con- 
nection in the wiring diagram to be useful for future physiological and beha- 
vioural experiments in different individuals, it should not just be correctly 
identified in one animal, but it should be generalizable across animals. We there- 
fore assessed the extent to which the connections in this wiring diagram are 
reproducible across individuals. 

For this study we obtained electron microscopy volumes from two different 
larvae: one comprising 1.5 nerve cord segments (a2/a3), and one comprising the 
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entire nervous system. We reconstructed the chordotonal and MD IV sensory 
neurons, the Basin neurons and all the other partners downstream of both the 
nociceptive and the sensory neurons in both electron microscopy volumes. In the 
smaller abd1.5 volume some of the downstream neurons had a large fraction of 
their arbor outside the volume and were not identifiable, whereas others were 
mostly confined within the electron microscopy volume and were uniquely iden- 
tifiable based on key morphological features. We could therefore assess the repro- 
ducibility of connections between homologous neurons present and uniquely 
identifiable in both volumes across two individuals (Extended Data Fig. 3). We 
identified connections between neuron types that have at least two synapses and 
for which a connection with at least two synapses also exists between homologous 
neuron types on the other side of the same individual. 100% (n = 36) of connec- 
tions with at least two synapses per connection that were reproducible across the 
same individual were also reproducible across individuals; that is, those connec- 
tions also existed on both sides in the other individual. 

If we consider each individual sensory neuron in a hemisegment as unique, we 
find a total of 76 connections (38 pairs of connections) between individual neurons 
with at least two synapses between homologues on both the left and right of the 
same individual. In 100% of cases in which a connection was present between 
homologues on both sides in one animal it was also present between homologues 
in the other animal, at least on one side. 96% of connections with at least two 
synapses that existed between homologous neurons on both sides in one animal 
also existed between homologues on both sides in the other animal. 4% of connec- 
tions with at least two synapses present between homologues on both sides in one 
animal were not present between homologues in the other animal on the same side, 
but a connection was present between homologues on the other side (in the other 
hemisegment) in the other individual (see Extended Data Fig. 3 for more details). 

Together with the above error rates, the observed reproducibility of connec- 
tions across individuals gives us strong confidence that the observation of a 
reliable connection in one animal both measures a real connection in that indi- 
vidual and predicts similar connectivity in other animals. 

Functional validation of connections mapped using electron microscopy 
reconstruction. Another way to validate a wiring diagram is to functionally test 
the connections obtained with electron microscopy reconstruction. If the ana- 
tomical and functional connectivity studies are done in different individuals this 
provides further validation of the reproducibility and generalizability of the wir- 
ing diagram. In this study we were able to generate reagents to functionally test a 
number of anatomical connections predicted by the wiring diagram. By activating 
the presynaptic partner and monitoring calcium transients in the direct post- 
synaptic partner we tested the following reproducible direct anatomical connec- 
tions with, on average, five or more synapses per connection: 

¢ MD IV onto Basin-4 (average >10 synapses per connection) (Fig. 3b) 

* Chordotonal onto Basins (average >10 synapses per connection) (Extended 
Data Fig. 4d) 

* Basins onto A00c (average >10 synapses per connection) (Extended Data 
Fig. 6)) 

* Basins onto A05q (average >5 synapses per connection) (Extended Data 
Fig. 6e) 

« A05q onto Goro (average 5 synapses per connection) (Extended Data Fig. 6g) 

For the anatomical connection from MD IV onto Basin-1 with only two 

synapses per connection on average, the functional connectivity experiments 
did not reveal a functional excitatory connection (Fig. 3a). 
Model of multilevel convergence. The model consists of two sensory inputs 
processed in two layers (Extended Data Fig. 8a). The sensory inputs, chordotonal 
and MD IV, have activity Emech and Enoci respectively. Sensory input intensity can 
range from 0-100 (arbitrary units), with the maximum value intended to approx- 
imate the highest intensity stimulus provided in the behavioural experiments of 
Fig. 1. The first processing layer has two nodes, each representing one or more 
neurons: (1) a ‘pure’ node with output gp(Emech) that receives only mechanosen- 
sory input; and (2) a ‘mixed’ node with output gj(EmechsENoci) that receives input 
from both sensory modalities. The second processing layer is a single node with 
output gp (gp, guy) that receives input from both nodes of the first processing layer. 
The output of the second layer constitutes the output of the circuit. 

We considered the steady-state output of the network, assuming that the 
response of each node is a sigmoidal function of its total input, with a lower 
threshold below which output is near-zero and an upper saturation point. We 
modelled this as a logistic function”: 


f (E; 0,8) = Cop —e=y 


lte FP 
with total input E, threshold 0, transition width f, and a constant Cy, z defined so 
that f(100)=100. We set 6=10 for all nodes, which makes the transition between 
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low and high output cover about half of the input range. We omit / in subsequent 
equations for brevity. 
The output of the pure node in the first processing layer is given by: 
&P (Etech) =f (Emech; Op) 

For the mixed node, we take the total input to be a weighted sum of the sensory 
inputs: gv (EmechsENoci) =f (Wa EMech + (1 — Wo) Enoci} Ot) 
where 0 <wy <1 measures the relative weighting of mechanosensory input. To 
require that the maximum total input each node can receive is the same, the 
weights sum to one. 
For the second layer, we take total input to be a weighted sum of the outputs from 
the nodes in the first layer: 


gB(Emech,ENoci) =f (Wegp(Emech) + (1 — wp) gm (EmechsENoci); Os) 


where 0 < wz <1 measures the weighting of the pure chordotonal node in the first 
layer. 

A common feature of multisensory circuits is their enhanced sensitivity to 
combinations of weak stimuli presented together. To consider this property in 
the context of our circuit, we looked for circuit responses that are maximally 
sensitive to multisensory stimuli, but that are also consistent with basic experi- 
mental observations. We treated this as a constrained optimization problem; we 
found mixing parameters wy and wg, that optimize a measure of multisensory 
sensitivity, subject to the constraint that the responses to unimodal stimuli are 
similar to experimental observations (Extended Data Fig. 8b-d). We looked at the 
resulting solutions for a range of threshold parameters. 

For the sensitivity measure, we assumed that the optimal solutions are respons- 
ive to combinations of both inputs (Extended Data Fig. 8b), as measured by the 
integrated response along a Gaussian-weighted band centred along the 
EMech= Enoci diagonal line: 


100 100 
_ a 2 
| &B (EmechsENoci)€ (Extech — Enoci)/44 dEMech4ENoci 


0 0 


S(wm.Ws) = 


where d measures the width of the band, which we set to 20 (Extended Data 
Fig. 8c). Although this condition alone would encourage strong responses to 
extremely low values of input stimulus, an undesirable situation, the combination 
of the constraint and the monotonicity of the response functions ensure that this 
cannot occur. 

For the constraint, we demanded that intense unimodal input match observa- 
tions (see Fig. 1c): (1) that intense, pure chordotonal input induced no rolling; and 
(2) intense, pure MD IV input induced a modest, but non-zero, fraction of 
animals to roll (Extended Data Fig. 8d). We quantified this as: 


h(wy.ws) = 


V@ol0, 100; wM,WB) — noci)” + (gp (100,0; wa.) —aMech)” 


where we set Gnoci = 30 and amech = 0 (Extended Data Fig. 8e). 

We then calculated optimal connection weights given neuron thresholds. We 
assumed that the pure mechanosensory interneuron does not have an extremely 
high threshold, set Op to a value between 30 and 50 and explored the constrained 
optimization problem for 0,4, and 0g values between 10 and 80. For each set of 
parameters, the multilevel convergence solution was defined as the circuit output 
corresponding to the parameters (wy, wg) that maximize S(w, wp) subject to 
h(wWyp Wp) < ho. We similarly defined the early convergence solution as the circuit 
with the wy, that maximizes S(wy, 0) subject to h(wy, 0) < ho. We set hy = 3 to 
ensure strong compliance with our unimodal constraints and numerically com- 
puted and compared solutions across a grid of wg and wg values starting at 0 with 
0.01 spacing (Extended Data Fig. 8f). To check the robustness of early and multi- 
level convergence solutions to threshold parameters, we explored multiple values 
of Og and 0, (Extended Data Fig. 8g). 
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Extended Data Figure 1 | This figure relates to Fig. la-d. Chordotonal (Ch) 
neurons respond to vibration in a ‘dose-dependent’ manner, but MD IV 
neurons do not, consistent with previous reports. a, 1,000 Hz vibration evokes 
calcium transients (mean + s.e.m. in this and all subsequent figures) in 
chordotonal (ch-GAL4>UAS-GCaMP6s), in a dose-dependent manner. 

b, c, Calcium transients of MD IV neurons in response to different vibration 
(1,000 Hz) intensities reveal no response to vibration (MD IV-GAL4>UAS- 
GCaMP6s). d, Quantification of calcium transients in mechanosensory (black) 
and nociceptive (orange) neurons from (a-c). n = 3 trials in each condition. 
e, A schematic of larval reactions to unimodal and multimodal stimuli. Line 
thickness qualitatively reflects the probability of the behaviour transition 
occurring in the absence of stimulation (bottom left) and after the onset of MD 
IV activation (orange triangle, bottom right), vibration (green triangle, top left), 
or their combination (black triangle, top right). Behaviours in grey occur with 
extremely low frequency. In response to nociceptive or combined stimulation 
animals can select one of two types of escapes responses: (1) roll-fast crawl; or 
(2) fast crawl alone. Even in response to combined stimulation the selection is 
probabilistic; the majority of animals select roll-fast crawl, but some select fast 
crawl without rolling. f, Crawling stride speed in the absence of stimulation 
(white) and following 1,000 Hz, 6.7m s * vibration (green) (in MD IV- 
GAL4>+ larvae); thermogenetic nociceptive activation (32 °C, in MD IV- 
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GAL4>UAS-dTRPAI] larvae) (orange); or their combination (grey). In the 
nociceptive and combined condition, we separately analysed the stride speed of 
animals that did not roll (non-rollers) and those that rolled (rollers) following 
stimulation. As previously described, vibration alone evoked an increase in 
stride speed. Nociceptive activation alone, or in combination with vibration, 
also evoked an increase in stride speed, both in animals that did not roll and in 
animals that did roll, after rolling. Combining vibration with nociceptive 
stimulation in animals that selected fast crawl significantly increased stride 
speed compared to either stimulation alone. Vibration combined with 
nociceptive stimulation therefore affects both aspects of the escapes response, 
facilitating the triggering of rolling (Fig. le, f) and increasing crawling stride 
speed in animals that do not roll. Wilcoxon rank sum test with Holm- 
Bonferroni correction. **P < 0.0001. g, The synergistic effect on rolling of 
combining vibration with nociceptive stimulation is observed across a range of 
frequencies. Rolling probabilities of MD IV-GAL4>UAS-dTrpA1 (nociceptive 
and combined conditions) or MD IV-GAL4>+ (no stimuli and 
mechanosensory conditions) animals during vibration stimulation at different 
frequencies, either alone (green) or in combination with thermogenetic 
activation of nociceptive neurons (32 °C) (grey). See Supplementary Table 3 for 
n (animals) in this and all other Extended Data Figures. See Supplementary 
Table 2 for a list of genotypes used all Figures. 
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Extended Data Figure 2 | This figure relates to Fig. 2a, b. a, b, Confocal 
images of basins (green) and nociceptors (a) or chordotonals (b) (both in 
magenta), with cartoons to demonstrate their position within the nerve cord. 
c, Confocal images of individual Basins (in addition to those shown in 

Fig. 3a) visualized with a FLP-based method. Outline, neuropil boundary. 
Scale bar, 30 um. d, Table of morphological features based on which Basins 
are uniquely distinguishable from other interneurons (blue) and individual 
Basin types from each other (purple). e, The table of morphological features 
based on which the mechanosensory chordotonal (Ch; green) and the 
nociceptive (orange) neuron projections are uniquely identifiable. f, Electron 
microscopy reconstructions of MD IV (orange) and chordotonal (green) 
axons in two abdominal segments, from two different animals. Top, 
reconstructions from the A3 segment in the smaller electron microscopy 
volume (comprising 1.5 abdominal segments: A3 and a part of A2) shown in 
Fig. 2. Bottom, reconstructions from the segment A1 in the comprehensive 
electron microscopy volume that spans the entire nervous system (L1 
electron microscopy volume, see Fig. 5 and Supplementary Video 7). 
Outlines indicate approximate neuropil boundaries. Scale bar, 20 um. 

g, Electron microscopy reconstructions of nociceptive (orange) and 
mechanosensory (green) axons and of the four Basin types (black) in the two 
abdominal segments, from two different animals. Top, A3 segment from the 
smaller 1.5 abdominal segment volume. Bottom, Al segment in the 
comprehensive L1 electron microscopy volume. Outlines indicate 
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approximate neuropil boundaries. Scale bar, 20 1m. The MD IV and 
chordotonal terminals span the ventromedial and ventrocentral nerve cord 
domains, respectively. Basin-2 and Basin-4 dendrites overlap with 
chordotonal and MD IV terminals. h, Detailed views of the neuronal arbors 
and synaptic locations of each Basin cell type from the two segments (from 
the two animals). Cells on the right side are mirrored for easier comparison. 
The location of synaptic domains is conserved across hemisegmental 
homologues and animals. The dendritic arbors of individual Basins across all 
four hemisegments are highly stereotyped and uniquely recognizable. 

i, Tables show the numbers of synapses that Basins in the A3 segment (from 
the smaller 1.5 segment electron microscopy dataset, top) and in the Al 
segment (from the comprehensive L1 electron microscopy volume, bottom) 
receive from MD IV and chordotonal neurons in their own and in adjacent 
abdominal segments. See also Supplementary Tables 4a and 5a for more 
information. Both the left and right Basin-1 and Basin-3 homologues 
reproducibly received many synapses (each >25 synapses and >15% of total 
input, on average) from chordotonal in both animals, but very few (no more 
than 1% of total input synapses) from MD IV neurons. Both the left and right 
Basin-2 and Basin-4 homologues received many inputs from both 
chordotonal (on average >20 synapses and >10% total input) and MD IV 
(on average >20 synapses and >10% total input) neurons. j, Schematic 
representation of the distribution of inputs from MD IV and chordotonal 
from adjacent segments onto Basins. 
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Extended Data Figure 3 | Reproducibility of the wiring diagram across 
animals. a, Electron microscopy reconstructions of the local interneurons 
downstream of both the MD IV and Basin neurons in both electron microscopy 
volumes; the volume comprising 1.5 abdominal (A2/A3) nerve cord segments 
of one animal (abd1.5, Fig. 2e), and the electron microscopy volume comprising 
the entire nervous system of another (L1, Fig. 5a and Supplementary Video 7). 
These local neurons (as well as the Basins shown in Fig. 2f and Extended Data 
Fig. 2h) with cell bodies and the majority of their arbor in the same segment as 
the sensory neurons are readily identifiable in both volumes and can be used to 
assess the reproducibility of sensory-interneuron and interneuron- 
interneuron connectivity across animals. Three distinct short-range 
interneuron types (A02m, A02n and A10a) received at least two synapses from 
nociceptive sensory neurons in both animals. A02m/n and A10a neurons also 
received inputs from Basin-2 or Basin-4 (Supplementary Information 
(Supplementary Atlas section)). The two interneurons A02m and A02n belong 
to the same lineage (lineage 2), have indistinguishable projections and 
connectivity, and can only be distinguished based on a small difference in the 
dorsoventral cell body location. In contrast, Basin-1, Basin-2, Basin-3 and 
Basin-4, even though they belong to the same lineage (lineage 9), all have 
distinct projections (and connectivity) and can each be considered a unique 
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neuron type. Al0a also have unique projections and belong to lineage 10. 

b, Total number of inputs each neuron in a receives. c, Reproducibility of 
connections between neuron types across individuals. Table shows class level 
connectivity between neurons in the abd1.5 and L1 data sets. Values given are 
the total sum of all synapses for all members of each class. 100% of connections 
with at least two synapses between homologous neuron types that were 
reproducible across the same individual were also reproducible across 
individuals; that is, the connection between those neurons existed in the other 
individual. d, Reproducibility of connections between individual neurons 
across individuals. Table shows connectivity between individual neurons in the 
abd1.5 and LI data sets. Whenever two or more synapses were found between 
homologous neurons in both hemisegments in one animal, either two or more 
were also found in both hemisegments in the other animal (dark green cells); or 
at least one synapse per edge was found in both hemisegments in the other 
animal (light green cells); or two or more synapses were found in at least one 
hemisegment of the other animal (yellow cells). Thus, in 100% of cases in which 
a connection was present between homologues on both sides in one individual 
it was also present between homologues in the other individual, at least on 
one side. 
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Extended Data Figure 4 | This figure relates to Figs 2 and 3. a, c, Electron 
micrographs of representative synapses formed by MD IV and chordotonal 
(Ch) projections onto Basins containing a T-bar ribbon. Circles point to the 
postsynaptic neuron and arrows point to vesicle clouds in the presynaptic 
terminals. All synapses have small clear-core vesicles associated with fast 
chemical neurotransmitters such as acetylcholine, glutamate, or GABA. Scale 
bars are 200 nm. b, d, Calcium transients evoked in Basins by MD IV (b) and 
chordotonal (d) activation by local ATP injection. The ATP-gated channel, 
P2Xp, is expressed in MD IV (b) and chordotonal (d) neurons and GCaMP6 is 
expressed in Basins (in MD IV-lexA>lexAop-P2X>; R72F11-GAL4>UAS- 
GCaMP6 (b) and Ch-lexA>lexAop-P2X>; R72F11-GAL4>UAS-GCaMP6 

(d) animals). n = 30 trials (b), and 18 trials (d). Functional connectivity 
experiments suggest the connection from sensory neurons onto Basins is 
excitatory. Since chordotonal neurons are the only neurons that respond to 
vibration**”’ we used vibration in subsequent experiments so that we could 
activate chordotonal with vibration, and MD IV with P2X and ATP injection, 
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in the same animal. e, R57F07 drives expression in Basin-4 neurons and ina few 
brain neurons. R57F07 was therefore used for generating the Basin-4 selective 
split-GAL4 line shown in g. g, h, Expression patterns of driver lines selective for 
Basin-1 (R20B01) and Basin-4 (R72F11 and R57FO07 intersection) neurons 
(green). Scale bar, 25 jum. f, Activation of R57F07 is sufficient to evoke rolling in 
45% of animals. Intersecting the parent line R57F07 with R72F11 (both drive 
expression in Basin-4 neurons and evoke rolling) results in a split-GAL4 line 
that drives expression selectively in Basin-4 neurons, shown in g. i, j, Negative 
control for Fig. 4d, f. i, Calcium transients in MD IV terminals observed in 
response to vibration (1,000 Hz, 6.8m s’), MD IV activation by local ATP 
injection or their combination (in MD IV-lexA>lexAop-P2X2; MD IV- 
GAL4>UAS-GCaMP¢6 animals). n = 14 trials. j, Maximum AF/F of calcium 
transients in response to mechanosensory, nociceptive, and combined 
simulation from i. Unlike Basin-4 (Fig. 3b, c), the MD IV neurons do not 
respond more to the combination of stimuli compared to MD IV activation 
alone. 
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Extended Data Figure 5 | This figure relates to Fig. 4.a—c, The RI6E11-LexA also drives expression in more neurons. Using a FLP-based intersection 


line faintly, but selectively, drives expression in the same pair of thoracic 
neurons (Goro neurons) as the R69F06 GAL4 line. a, Co-localization of 
R16E11-LexA and R69F06-GAL4 expression in the same nervous system 
(genotype: R16E11-lexA/LexAop-mry-tdTomato;R69F06-GAL4, UAS-mry- 
GFP). RI6E11-lexA>lexAop-tdTomato (red) and R69F06-GAL4>UAS-GFP 
(green) co-localize in the two Goro cell bodies and their projections in the nerve 
cord (yellow). b, c, Single-channel images from a. Scale bar, 20 um. 

d, Thermogenetic activation (32 °C) of the single pair of Goro neurons using 
the RI6E11 line (R16E11-lexA>lexop-dTRPA1) evoked rolling in 36% of 
animals. Activating neurons in R69F06 evokes even more rolling (86%, Fig. 4a). 
R69F06 drives expression in Goro neurons more strongly that RI6E11, but it 


between R16E11-lexA and R69F06 (see Methods), we found that 76% of animals 
with expression of dTRPA1 selectively in Goro rolled when exposed to 
restrictive temperature (32 °C). Thus, activating the single pair of command- 
like Goro neurons evokes rolling. e, Calcium transients in Goro dendrites in 
response to selective Basin-4 activation by local ATP injection (m = 12 trials). 
The ATP-gated P2X, channel and GCaMP6 are expressed selectively in Basin-4 
and Goro neurons, respectively (in Basin-4-Gal4>UAS-P2X>; Goro- 
lexA(R69F06-lexA)>lexAop-GCaMP6 animals). f, Basin (magenta) and Goro 
(green) projections do not co-localize (in 16E11-lexA>LexAop-tdTomato; 
R72F11-GAL4>UAS-GFP larvae), indicating that the two cell types that are 
functionally connected cannot be directly synaptically connected. 


©2015 Macmillan Publishers Limited. All rights reserved 


a b 
Basin activation 
a _______. 7 
80 5 
= brain lobe intact x 7 
ev brain lobe cut a | 
= brain lobe cu uc 
gz | ec 
us S| 
iz — | 
‘| 2 
8 8 | i 
oo 5s 1 
0 4 
0+ Brain Brain 
intact cut 
d Basin-2 
ro 
af 
oie 
| 
i> = 
7 g 
Basin activation A05q activation 
a ae — 
50 50 
=> oS il 
uw ue 
+ ~ 5 
3 g 2s 
it 2 
oO jo} 
<= 5s (0) 
0 Ps on 0 


Extended Data Figure 6 | This figure relates to Fig. 5. a, The nerve cord 
Basin-Goro pathway is sufficient to activate the Goro neurons. Calcium 
transients in Goro dendrites evoked by Basin activation by local ATP injection 
with the brain lobes intact (m = 15 trials) or removed (n = 24 trials) (in R72F11- 
LexA>LexAop-P2X3; Goro-GAL4(R69F06)>GCaMP6 animals). 

b, Quantification of a. The difference is not significant. Error bars show s.e.m. 
c, Left, R47D07 drives expression in A05q neurons (that receive direct inputs 
from Basin-2 and synapse onto Goro neurons) and in other neurons in thorax 
and brain (green, visualized with anti-GFP). In abdominal segments A8 and A7, 
the only neurons labelled with this line are the A05q neurons (dashed circle). 
The rest of the nervous system (grey) is visualized with anti-N-cadherin. White 
square, A05q axon terminals that synapse onto Goro dendrites. Scale bar, 

25 um. Note that this line also drives expression in developing adult neurons in 
thorax and brain that are not functional in the larval nervous system (secondary 
lineages). Right, image of individual A05q cells (magenta) visualized with a 
FLP-based strategy in R47D07, in top-down (top) and cross-section (bottom) 
views. Arrowheads indicate midline. Thin white lines indicate neuropil 
boundary. d, f, i, Electron micrographs of representative synapses formed by 
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Basin-2 onto A05q (d), A05q onto Goro (f) and Basins onto A00c (i) containing 
T-bar ribbons. Circles point to the postsynaptic neuron and arrows point to 
vesicle clouds in the presynaptic terminals. All synapses have small clear-core 
vesicles associated with fast chemical neurotransmitters such as acetylcholine, 
glutamate, or GABA. Scale bars, 200 nm. e, Calcium transients in A05q axon 
terminals (square in c) evoked by Basin activation using local ATP injection 
(R72F11-LexA>lexop-P2X>; R47D07-GAL4>UAS-GCaMP6, n = 18 trials) 
suggest a functionally excitatory connection. g, Calcium transients in Goro 
neurons evoked by A05q activation using local ATP injection specifically in the 
vicinity of A05q dendrites in A8 (dotted circle in c; R47D07-GAL4>UAS-P2X,; 
Goro(R69E06-LexA>lexop-GCaMP6, n = 18 trials) suggest a functionally 
excitatory connection. h, Thermogenetic activation of neurons in R47D07 
triggers rolling (exposing R47D07-GAL4>UAS-dTrpA] animals to 32 °C). 
Control, +>UAS-dTrpA1. j, Calcium transients in A00c evoked by Basin 
activation using local ATP injection (R72F11-LexA>lexop-P2X,; R71A10- 
GAL4>UAS-GCaMP6, n = 15 trials) suggest a functionally excitatory 
connection. 
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Extended Data Figure 7 | Distinct neurons differentially control distinct 
aspects of the escape sequence. a, Graphs show the time course of rolling 
(green) and crawling (red and blue) behaviours before, during and after a 15 sec 
(indicated by the grey box) optogenetic (617 nm red light) activation period. 
The percentage of larvae that are rolling (green line) or crawling (red line) at 
each time point is shown. The normalized speed of crawling strides (normalized 
by speed prior to optogenetic stimulation) is also shown for those animals that 
are crawling at the time (blue line). Means (dark line) and s.e.m. (shaded areas) 
are shown. Data are from +>UAS-CsChrimson, basin1-GAL4>UAS- 
CsChrimson, basin4d-GAL4>UAS-CsChrimson, Goro-GAL4(R69F06)>UAS- 
CsChrimson, or A00c-GAL4(R71A10)> UAS-CsChrimson animals, from top to 
bottom. b, Percentage of larvae that rolled at least once during the entire 15 sec 
activation window. Error bars are 95% confidence intervals. **P < 0.0001. 
Optogenetic activation of Basin-4 and Goro neurons evokes rolling, consistent 
with the results of thermogentic activation experiments shown in Figs 2a, g and 
4e. Note that the instantaneous percentage of rolling shown in a is always lower 
than the percentage shown in b because not all larvae roll at exactly the same 
time. c, Comparison of the normalized speed of crawling strides in response to 


optogenetic activation. Boxes show medians (red) with 25th and 75th quantiles 
of the data. Whisker is 1.5 interquartile range (IQR) for the length of the 
whiskers. Red plus is outlier. **P < 0.0001. We observe that in control larvae, 
red light alone evoked a mild reduction in crawling speed, whereas Basin-1 
activation evoked a mild, but significant, increase in stride speed and did not 
evoke rolling. Basin-4 activation evoked the entire escape sequence (roll 
followed by fast crawl). Note first an increase in instantaneous percentage of 
rolling larvae (peak in green line in a), followed by an increase in the 
instantaneous percentage of crawling larvae, and an increase in stride speed 
relative to the period prior to stimulation (peak in blue line in a). Goro neuron 
activation only evoked rolling and not crawling (note the percentage of 
crawling larvae is almost 0 during the optogenetic activation period). 
Furthermore there was no increase in relative stride speed following stimulation 
offset. Activation of the three A00c neurons with R71A 10 was not sufficient to 
evoke rolling, but it was sufficient to evoke a mild, but significant, increase in 
stride speed. Thus, distinct neurons downstream of Basins appear to 
differentially control distinct aspects of the ‘roll-fast crawl’ escape sequence. 
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Extended Data Figure 8 | Multilevel convergence can enhance sensitivity to 
weak multimodal stimuli. A benefit of the multilevel architecture could be to 
amplify the effect of mechanosensory cues on nociceptive cues, increasing the 
sensitivity to relatively weak bimodal cues. To explore this idea further, we used 
a simple model to ask whether a two-level network with two levels of 
convergence (multilevel convergence; MC) can be more sensitive to relatively 
weak bimodal events than a network with only early convergence (EC). 

a, Schematic for a simple model of early and multilevel convergence networks 
(see Methods for details). We considered an early convergence and a multilevel 
convergence two-layer feed-forward circuit with two inputs, corresponding to 
MD IV (Enoci) and chordotonal (Emech), and one output corresponding to 
rolling occurrence. We modelled steady-state firing rates, where the output of 
each model neuron is a sigmoidal logistic function (with a lower threshold and 
upper saturation) of a weighted sum of its inputs. One pathway remains only 
chordotonal, while the other mixes modalities only early, or at both levels, 
depending on weights (wy and ws). b-d, Solutions for wy and wz are found 
using a constrained optimization procedure that maximizes sensitivity to weak 
bimodal inputs (b, c) within two experimentally observed unimodal target 
outputs (d): (1) that mechanosensory stimulus alone never evokes rolling; and 
(2) that the strongest MD IV stimulus alone evokes only 30% output (Fig. le, f 
and see Methods for details). e, Example deviation h from target outputs (see 
Methods) for multiple values of wy, and wz for one set of thresholds (0,, = 40 
Om = 50; 03 = 75). f, Values of wy and wg that maximize sensitivity S (dots) 
while also satisfying h < 3 (grey area). g, Early and multilevel solutions (if they 
exist) for other thresholds. Solutions exist if one or both Oy or @3 is high 
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(keeping 0,, = 40). Note that although the optimal sensitivity multilevel 
convergence solution could have wx = 0 and thus effectively be an early 
convergence solution, this does not occur; multilevel convergence solutions are 
always more sensitive when they exist. Multilevel convergence solutions with 
wz = 0 are not shown since they do not exhibit multilevel convergence. 

h, Sensitivity of the optimal multilevel convergence solution as a function of the 
same thresholds as in Extended Data Fig. 8g, normalized by the most sensitive 
multilevel convergence solution found. Multilevel convergence solutions are 
more sensitive to relatively weak multimodal stimuli than early convergence 
solutions. Across all thresholds tested, the most sensitive circuits occur for 
Om= 50 and 03 = 75 (Extended Data Fig. 8k). For such parameters, no early 
convergence solutions satisfy the unimodal constraints, hence multilevel 
convergence solutions are overall the most sensitive. i, j, Example early 
convergence and multilevel convergence solutions for wz and ws that satisfy the 
condition in d for one set of neuronal firing thresholds (9, = 40; Oy = 50; 

03 = 75). k, Subtracting the output of early convergence from the multilevel 
convergence circuit. The multilevel convergence circuit triggers more rolling 
than the early convergence circuit in response to relatively weak bimodal cues 
and strong unimodal MD IV cues, but less rolling than the early convergence 
circuit to strong unimodal chordotonal cues (top left corner). The multilevel 
convergence circuit thus offers a more complex response function with greater 
sensitivity and selectivity for multimodal and strong unimodal MD IV cues, but 
not chordotonal cues, and could enable enhanced selection of rolling to more 
threatening events and reduced selection of rolling to less threatening events. 
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Structure of the human 80S ribosome 


Heena Khatter*****, Alexander G. Myasnikov''??"**, S. Kundhavai Natchiar"**** & Bruno P. Klaholz!:?3-* 


Ribosomes are translational machineries that catalyse protein synthesis. Ribosome structures from various species are 
known at the atomic level, but obtaining the structure of the human ribosome has remained a challenge; efforts to 
address this would be highly relevant with regard to human diseases. Here we report the near-atomic structure of the 
human ribosome derived from high-resolution single-particle cryo-electron microscopy and atomic model building. 
The structure has an average resolution of 3.6A, reaching 2.9 A resolution in the most stable regions. It provides 
unprecedented insights into ribosomal RNA entities and amino acid side chains, notably of the transfer RNA binding 
sites and specific molecular interactions with the exit site tRNA. It reveals atomic details of the subunit interface, which is 
seen to remodel strongly upon rotational movements of the ribosomal subunits. Furthermore, the structure paves the 
way for analysing antibiotic side effects and diseases associated with deregulated protein synthesis. 


Ribosomes form the hub of translation across all kingdoms of life, 
synthesizing proteins using an mRNA template. An interesting con- 
cept that has emerged from previous ribosome structures is its layered 
evolution, with a conserved core across all species comprising the 
catalytic peptidyl transferase centre (PTC) and GTPase- (guanosine 
triphosphatase) associated centre in the large ribosomal subunit, and 
the decoding centre in the small subunit’. The exceptionally complex 
ribosomes of multicellular eukaryotes contain additional rRNA com- 
ponents, termed expansion segments (ES), and additional protein 
moieties as compared to their prokaryotic counterparts. The human 
ribosome (80S) has a molecular weight of 4.3 MDa: the large subunit 
(60S) consists of 28S, 5S and 5.8S rRNAs and 47 proteins, while the 
small subunit (40S) possesses a single 18S rRNA chain and 33 pro- 
teins. The structures of mammalian cytoplasmic ribosomes have pro- 
vided locations and folds of all eukaryote-specific components’”’. A 
recent structure of the human ribosome at an average resolution of 
~5.4-6.0A provided useful insights’; however, structural data closer 
to 3.0A resolution are required to derive an atomic model of the 
human ribosome, the ultimate goal of eukaryotic ribosome structural 
studies. The ribosome is a target for various human diseases (notably 
infectious diseases and cancer), which gives a structure at or around 
3.0 A resolution added medical importance, especially with regard to 
the growing problems of antibiotic resistance and drug therapy side 
effects. 

Over the past year, structure determination of asymmetric objects 
such as the ribosome by single-particle cryo-electron microscopy 
(cryo-EM) has reached the 3-4 A resolution range (for example, refs 
8-11), owing largely to improved electron detectors'*""*. Nevertheless, 
the potential of cryo-EM-associated atomic model building and 
refinement procedures as used in the field of X-ray crystallography 
has not been fully exploited, especially in terms of structure refine- 
ment of large macromolecular complexes. Here we present the near- 
atomic structure of the human ribosome determined using high-reso- 
lution electron microscopy and a direct electron detector (comple- 
mentary metal oxide semiconductor (CMOS) detector with a back- 
thinned direct detection device (DDD) sensor) in combination with 
recent developments in image processing (dose fractionation and 
movie processing), 3D reconstruction and atomic model building 


using crystallography refinement procedures. The structure of the 
human 80S ribosome provides unprecedented insights into rRNAs 
and ribosomal proteins down to the level of individual residue side 
chains, inter-subunit contacts, functionally relevant key regions 
including the PTC and the decoding region, interactions with the 
tRNA present in the exit (E) site, and potential ligand-binding pockets. 
These can be further exploited for the molecular description of anti- 
biotic and other potential drug pockets. 


Structure determination 


Human 80S ribosomes from HeLa cells were prepared using a recently 
established protocol’* that yields highly homogenous samples of crys- 
tallizable quality and which contain a tRNA in the E-site; however, 
X-ray diffraction of these crystals was limited to ~25 A (ref. 15). The 
combination of highly homogenous samples with on-site advanced 
single-particle cryo-EM imaging techniques (including movie proces- 
sing done here with only 3 frames) produced a structure at near- 
atomic resolution (Fig. 1). While the average resolution is estimated 
to ~3.6 A (Extended Data Fig. 1), local resolution calculations (Fig. 2) 
show that most regions are between 2.4 and 46A resolution, with 
significant components at 2.9 A (see features in Extended Data Fig. 2), 
and only decrease to ~6 A in the more flexible regions at the peri- 
phery. An atomic model of the human ribosome was obtained by 
extensive model building and restrained parameter refinement pro- 
tocols'® (see Methods). We combined this approach with a recent 
development in the field of X-ray crystallography called feature- 
enhanced map (FEM”’), which robustly increases the interpretability 
of maps. For the first time, FEM map calculations were used in the 
context of cryo-EM maps for structure interpretation (Fig. 2c, d), 
which for the well-ordered regions of the human 80S ribosome show 
resolution and features that go beyond those of the original cryo-EM 
map. Final electron density maps were calculated by combining 
phases of the refined atomic model and Fourier coefficients derived 
from the cryo-EM map, as is routinely done in the crystallography 
field, yielding further improved maps with features clearly beyond 3 A 
resolution (Fig. 2e and Extended Data Figs 3 and 4). This observation 
indicates that useful amplitudes are present in the cryo-EM data 
beyond the average resolution estimated by Fourier-shell correlation. 
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Figure 1 | Overall structure of the human 80S ribosome. a, b, Global views of 
the 80S human ribosome from the solvent sides of the 60S and 40S subunits 
(a) and from the inter-subunit interfaces with ribosomal proteins indicated on 
the atomic model (b) (backbone representation; proteins uS14, eS21, eL13, 
eL21, uL24, eL32, eL33, eL37 and eL39 further inside the structure are not 
labelled). 


The final atomic model (Fig. 1) comprises ~220,000 atoms across the 
5,866 nucleotide residues and ~ 11,590 amino acids of the 80 proteins, 
the 4 rRNAs and E-site tRNA, and 239 Mg”~ ions. 


Description of the overall structure 

The overall structure shows the common landmarks of the 40S and 
60S ribosomal subunits (Fig. 1). Compared to the yeast ribosome’, 

the human ribosome contains an additional protein, eL28 (ref. 1). We 
observed additional densities corresponding to the carboxy terminus 
of uS2 (residues 209-222) and also redefined the globular structure of 
protein eS17 (residues 72-93) in the 40S subunit (Extended Data Fig. 
5). The present structure represents a pre-translocation state in which 
the 40S subunit is non-rotated (Fig. 3); this non-ratcheted state is 
similar to the yeast ribosome’, while the recent human and porcine 
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Figure 3 | Conformational changes of the human ribosome. a, Comparison 
of the human ribosome structure in the pre-translocation state (present study, 
pink 40S, grey 60S) and in the post-translocation state (when factor eEF2 is 
present, cyan 40S, porcine ribosome), revealing three major types of movement. 
b, ¢, Illustration of large conformational changes of protein eL24 (eB13) and 
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Figure 2 | Local resolution and electron density maps. a, b, Local resolution 
estimation of the cryo-EM map (central section) and histogram distribution 
of resolution values. c, d, Comparison of cryo-EM (c) and FEM (d) maps 
(arrows highlight regions with improved density). e, Example of electron 
density map obtained by combining amplitudes derived from the cryo-EM map 
and phases calculated from the iteratively refined atomic model (Phenix map 
calculation), illustrating the interactions of ribosomal protein uL2 with 28S 
rRNA. f, Example of the definition of side-chain residues (Phenix map) 
forming a ligand-binding pocket (here modelled with blasticidin, a nonspecific 
antibiotic) as required for structure-guided drug design purposes. 


ribosome structures exhibit essentially the same post-translocation 
conformation due to the presence of elongation factor eEF2 (refs 1, 
8). The present conformation is consistent with the presence of a 
tRNA molecule in the E-site and the movement of the L1 stalk 
towards the core of the ribosome. The L1 stalk rRNA interacts with 
the tRNA close to the elbow region, as observed earlier for the 70S 
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eL19 (eB12) occurring upon 40S subunit rotation. The C-terminal helix of eL24 
is relocated to interact with a different region of the 18S rRNA. eL19 undergoes 
a structural transition from a straight (green, post-translocation state’) to a 
bent protein helix (pink, pre-translocation state, present study). 
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ribosome”. In contrast to the previous human and porcine ribo- 
some structures, the 40S subunit head shows a swivel movement away 
from the 60S, which leads to a partial closure of the latch region (Figs 1 
and 3a). This could be related with the fact that the mRNA channel is 
empty, that is, it contains neither mRNA nor protein factors involved 
in ribosome starvation, in contrast with previous yeast and human 
ribosome structures'’. In summary, the 40S subunit undergoes three 
characteristic types of conformational changes: (1) a universal rota- 
tion similar to ‘ratcheting’ in bacterial ribosomes”’; (2) a swivel move- 
ment of the 40S head and beak regions in the direction of the tRNA 
E-site; and (3) a rotation of the 40S body around a vertical axis per- 
pendicular to the classical rotation axis termed ‘subunit rolling’ (not 
observed in bacterial ribosomes, but recently described for a eukar- 
yotic complex” Fig. 3a). The subunit rolling leads to a slight opening 
of the ribosomal subunit interface (discussed below) on the side of the 
factor entry site where ribosomal GTPases bind. In this context, it is 
also interesting to note that relatively large crevasses and gaps exist 
between the different structural domains of the 18S rRNA, which 
could serve to accommodate conformational changes of the 40S 
throughout the translation process. 

The transition from a pre- to a post-translocation state induces 
several major rearrangements in the structure, highlighted by ribo- 
somal proteins eL24 and eL19 (Fig. 3b, c), which constitute some of 
the characteristic eukaryote-specific bridges (eB12 and eB13, anno- 
tated in Fig. 4). The C-terminal part of eL24 relocates entirely and 
forms a new interaction site on the 40S subunit with ribosomal pro- 
tein eS6 and 18S rRNA (Fig. 3b). In the case of eL19, a long protein 
helix protrudes from the 60S subunit to cross over to the 40S subunit. 
This is one of the rare 60S components (apart from the L1 region) that 
strongly changes conformation upon subunit rotation. The 
C-terminal helix of eL19 is kinked in the pre-translocation state 
(and extends by another seven residues), while it is straight in the 
post-translocation state (Fig. 3c). This conformational transition 
results in formation of salt bridges between the positively charged 
Arg172 and Arg176 side chains of protein eL19 and the phosphate 
moieties of 18S rRNA nucleotides G909 and G910 (Fig. 3c). 


Inter-subunit interface 


The present structure highlights some of the characteristic inter- 
subunit contact sites in more detail with respect to the molecular 
interactions between the rRNA and protein residues involved. The 
interface comprises a series of contact regions that have been anno- 
tated (Fig. 4, summarized in Extended Data Table 1) according to the 
nomenclature used for inter-subunit bridges of the yeast ribosome” 
(extended by two new bridges eB15 and eB16’, not seen in the present 
structure). The core of the contact interface is formed by helix H69 
(28S rRNA, A3766 region; ‘h’ and ‘H’ are used to number rRNA 
helices of the small and large subunits, respectively) from the 60S 


Figure 4 | Ribosomal inter-subunit bridges. Annotation of the inter-subunit 
bridges at the interface of the 40S and 60S subunits (rRNA in light grey, 
ribosomal proteins in dark grey, colour-coded bridges are indicated). 
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subunit which interacts with helix h44 (18S rRNA, G1849 region; 
Fig. 4) to form bridges B2a and B2b, close to the universal rotation 
axis (Fig. 3a) around which the ribosomal subunits rotate forward and 
backwards during the tRNA translocation process, as seen in prokar- 
yotic and eukaryotic ribosomes*”*****. Among the known inter- 
subunit bridges, B7b/c and B8 are not observed in the present human 
80S ribosome (not including contact-mediating ions such as Os in the 
yeast 80S structure or Mg’ ), while the bridge eB8 contact is mediated 
by eS1 and eL8, in contrast with the yeast 80S ribosome’, where eL43 
replaces eL8. Furthermore, there is an additional B2e bridge present as 
observed previously”. It is remarkable that a noticeable gap exists 
between the 40S and 60S subunits in the region below the factor 
binding site, which is related possibly with the vertical rotation of 
the 40S subunit and a slight opening of the interface (Fig. 3a). In 
contrast to other contact regions, there are only few densities observed 
in the inter-subunit space, which could be ions such as Mg’*. In gen- 
eral, five contact types are observed: ribose-ribose, ribose—phosphate, 
ribose-base, protein-base and protein-protein interactions (Fig. 5). 
Below, we highlight some of the most prominent examples of inter- 
subunit contact regions in the human ribosome. 

A particularly interesting contact type are the interactions 
mediated by ribose moieties, with the 2’-OH groups of the rRNA 
backbone sugar moieties playing a key role rather than the charged 
phosphate groups of the rRNA backbone (Fig. 5a). The 2’-OH groups 
are offset, leading to a bifurcated hydrogen-bonding pattern. This 
pattern is distinct from sugar-edge (ribose-base) interactions” and 
somehow similar to that of protein B-sheets involving the carbonyl 
backbone***. This leads to a characteristic ribose-ribose interface 
involving mostly hydroxyl groups of the sugar moieties, as illustrated 
by the interactions of the 2'-OH groups of nucleotides A1719, A1815 
and G1816 (18S rRNA helix h44) with the 2'-OH groups of nucleo- 
tides G3806, A3807, C3794 and A3795 (28S rRNA helix H71; Fig. 5a). 
Similar ribose-ribose interactions exist between ribose moieties in the 
subunit interface of bacterial ribosomes*’~’. These ribose-ribose con- 
tact regions differ from other regions of the interface in which a 
significant amount of interactions occur through Mg”*-mediated 
phosphate-phosphate electrostatic interactions, while the bridges 
located more at the periphery operate through direct hydrogen bond- 
ing rather than being mediated by Mg”* (for example, uS13—uL5, 
Fig. 5, or the eukaryote-specific eL19 and the 18S rRNA 910 region; 
Fig. 3b, c). This difference could explain the observation that the 
biochemical stability of eukaryotic ribosomes is much less Mg” - 
dependent as compared to bacterial ribosomes, and much more 
dependent on K* ions, consistent with the idea that potassium is 
required for inter-subunit stability while Mg’* ensures proper 
rRNA folding’****°. Characteristic protein-RNA interactions exist, 
for example, between 18S rRNA region 1173 and the amino-terminal 
helix of protein eL41 and involve phosphate group interactions with 
positively charged residues (Arg and Lys; Fig. 5b). Other examples of 
interface remodelling are regions of the ribosomal subunit interface in 
which rRNA ribose and Mg- * -mediated contacts sites are also seen to 
switch positions upon rotation of the ribosomal subunits. This 
involves rearrangement of the interface and Mg’ * -ion repositioning 
(for example, bridges B2b, 1848 nucleotide region of 18S rRNA, and 
phosphate backbone of nucleotide 1070; solvent channels with resid- 
ual densities for more potential ions are found right below the E-site 
tRNA binding site). 

Owing to the conformational difference of the structure as com- 
pared to previous human and porcine ribosome structures, numerous 
contact sites are shifted or completely rearranged. Apart from the two 
examples of bridges eB12 and eB13 described above that involve 
major conformational changes of the involved proteins, another 
prominent example is that of bridge B1b/c (Fig. 5c). This contact is 
located between the central protuberance of the 60S and the 40S head 
and involves ribosomal proteins uL5 and uS13 (Fig. 5c). Rather than 
operating through large conformational changes of parts of the 
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Figure 5 | Molecular details of characteristic inter-subunit contacts. 

a, Detail of bridge B3 highlighting ribose-ribose and ribose—nucleotide 
interactions between 28S rRNA helix H71 and 18S rRNA helix h44, mediated 
through the 2'-OH ribose moieties. b, Characteristic protein-RNA interaction 
between 18S rRNA region 1173 and the N-terminal helix of protein eL41 
(bridge eB14) involving phosphate group interactions with positively charged 


proteins, this bridge rearranges through a protein shift that is amp- 
lified at the periphery of the 40S subunit when rotating. Upon subunit 
rotation and 40S head swivelling the molecular interactions between 
uS13 and uL5 are completely remodelled and the interacting residues 
of uS13 shift by about 23-26 A (Cx atom of Argl4 and Arg104, 
respectively). In the pre-translocation state, Ilel113 and Lys118 
(uL5) contact the side chains of Arg14 and Gln10 (uS13). By contrast, 
in the post-translocation state as observed in the previous human and 
porcine ribosome structures, this bridge is formed by Tyr119 of 
uL5 with Argl08 of uS13 (see schematic in Fig. 5c); the large shift 
allows additional interactions between uL5 and uS19 to be formed. 
Similar rearrangements have been observed in yeast and Escherichia 
coli ribosomes*?'””. 
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Figure 6 | Molecular recognition by the human ribosome at the level of the 
E-site tRNA. a, Top view of the E-site (ribosomal subunits and the L1 stalk are 
annotated). b, Comparison of the E-site tRNAs from the human and porcine 
ribosome structures (refs 1, 8; ribosome in the E/E and P/E states, anticodon 

and CCA regions are labelled). c, Details of the interactions between the CCA 
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residues. c, Details of bridge B1b/c which involves proteins uS13 and uL5 close 
to the central protuberance of the ribosome. The schematic drawing illustrates 
the remodelling of the inter-subunit interface that occurs at the atomic level 
between the pre- and post-translocation states (see also Fig. 3; present study and 
refs. 1, 8, respectively). 


tRNA binding sites and molecular recognition in E-site 


The present human 80S ribosome contains a co-purified tRNA in the 
E-site, which is well-defined in the cryo-EM map and appears to 
stabilize the L1 stalk (Fig. 6a). Comparison with other E-site tRNA 
structures shows that its conformation is different in the anti-codon 
loop regions of the decoding stem, while it is similar at the CCA end of 
the acceptor stem (Fig. 6b). The tRNA is found in the classical E/E 
state consistent with the pre-translocation state of the ribosome, while 
it is observed in a P/E hybrid state in the porcine ribosome***”” (the 
presence of tRNA at each of these sites is annotated as E/E or P/E, with 
the first symbol denoting the contact with the small subunit and the 
second referring to the large subunit). On the 60S subunit, the E-site 
tRNA interacts with the L1 stalk oriented inwards, and with the L1 
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end region of the E-site tRNA and the 28S rRNA and protein eL32 of the 60S 
subunit, showing an intercalation of base C76 in between nucleotides G4370 
and G4371, and a m-stacking of base C75 with Tyr41 of eL32. The antibiotic 
cycloheximide would clash with the CCA region of the tRNA. 
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protein at the level of the tRNA elbow. The L1 stalk rRNA interacts 
with the tRNA close to the elbow region and at the top of the decoding 
stem (Fig. 6a). The CCA end exhibits specific interactions in a ligand 
pocket, formed by G4370 and G4371 of the 28S rRNA and ribosomal 
protein eL32 (Fig. 6c). A76 of the tRNA intercalates between bases 
G4370 and G4371, while C75 forms m-stacking interactions with 
Tyr4l1 of eL32 (Fig. 6c). The existence of molecular recognition events 
in the E-site binding pocket is an interesting observation considering 
that the tRNA behaves as a reaction product, which maintains specific 
interactions with the enzyme, after the transfer of the nascent peptide 
to the new P-site tRNA during the translation process. These specific 
interactions are present although the tRNA probably is a mixture of 
co-purified tRNAs, suggesting that these interactions are common to 
all tRNA, consistent with the universal conservation of the CCA end 
sequence’. The binding pocket of the tRNA CCA end overlaps with 
the binding site of the antibiotic cycloheximide known from the yeast 
ribosome’. On the 408 side, the E-site tRNA decoding stem fits into a 
pocket formed by protein uS7 and bases G961, A1641 and C1684 of 
the 18S rRNA (Extended Data Fig. 6). Although details of the ribo- 
some interactions with the anti-codon cannot be resolved, their pres- 
ence suggests that interactions with the mRNA are partially released 
in the E-site as seen previously in 70S crystal structures”. 

While the A- and P-sites are empty in the present structure, the two 
binding pockets are precisely preformed with many structurally well- 
ordered side chains for molecular recognition of the tRNA molecules. 
Exceptions to this are A4548 (28S rRNA helix H93, part of the PTC, 
corresponds to A2602 in E. coli) and the central part of protein uL16 
that forms a loop (residues 101-117), which are essentially disordered 
in the P-site (while the C-terminal part of uL16 located in the A-site is 
fully defined; Extended Data Fig. 6). This suggests that these compo- 
nents of the P-site pocket actively accommodate the tRNA acceptor 
stem and CCA end upon binding and become structurally ordered (in 
the presence of a P-site tRNA the loop interacts with bases 1 and 2 of 
the tRNA)*. 

The structural stabilization upon substrate binding also applies to 
helix H69 on which the P-site tRNA decoding stem lodges, and to 
U4452 in the PTC. H69 shows some structural disorder, while 18S 
rRNA helix h44 with which it forms bridge B2a is well ordered. Thus, 
P-site tRNA binding is likely to reinforce inter-subunit interactions 
across this bridge through the stabilization of H69, consistent with the 
idea that P-site tRNA binding stabilizes the fully assembled 80S com- 
plex, which is required for translation initiation “**. Similarly, nuc- 
leotide U4452 of 28S rRNA at the entry of the peptide exit channel is 
disordered in the absence of growing peptide, while all side chains 
forming the peptide channel through the 60S subunit are well ordered. 
U4452 is located at a potential branching point of the peptide channel, 
suggesting that this residue could serve as a gatekeeper to correctly 
orient the early growing nascent peptide. 

Regarding the A-site, all ribosomal residues including nucleotides 
U4501, U4500 and G4499 (helix H92 of 28S rRNA) are well defined 
and pre-oriented for base-pair interactions respectively with the com- 
plementary C74, C75 and A76 nucleotides of the tRNA, with U4438 
(H89 of 28S rRNA) prepared for recognition of A73 of the tRNA. On 
the 40S subunit, the decoding centre is entirely pre-formed including 
most side chains. The universally conserved nucleotide residues A1824 
and A1825 of 18S rRNA helix h44 (A1492 and A1493 in E. coli num- 
bering) are not resolved well enough to designate their position as 
flipped-in/out, which is known to be important for decoding”. 


Outlook 


This work provides, to our knowledge, the first high-resolution struc- 
ture of the human 80S ribosome with unprecedented insights into 
rRNA and amino acid residues, including detailed side-chain positions 
and conformations. It reveals the basis of molecular recognition 
through hydrogen bonding between ribosomal RNAs and/or protein 
moieties within the entire ribosome structure, at the level of the exit site 
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tRNA and within the subunit interface, which is seen to remodel upon 
rotation of the ribosomal subunits. Comparison of the pre- and post- 
translocation states illustrates how inter-subunit bridges have a marked 
impact on the dynamics of the human ribosome and provides atomic 
details about subunit interface rearrangements. While the present ribo- 
some complex contains only an E-site tRNA for which detailed inter- 
actions are described here, its structure constitutes a reference for 
future functional complexes with ribosomal factors such as mRNA, 
tRNA and associated protein factors, either directly involved in the 
translation mechanism or implicated in general translation regulation. 
Biochemically, the present human 80S ribosome samples will be better 
suited for forming functional complexes because they do not contain 
any hibernation or starvation factors such as Stm1 (ref. 2) or serpine 1 
mRNA-binding protein 1 (SERBP1)’, located within the path of the 
mRNA in previous human and yeast ribosome structures. 

Another important aspect is that, for drug design purposes, it is 
crucial to work with the human target rather than with that of related 
model systems because of unpredictable effects such as second layer 
residues around ligand-binding pockets, which can modify the inter- 
action pattern of the primary contacting residues or open new pockets 
by conformational changes of side chains. Our structure provides a 
valuable repertoire and molecular description of antibiotic binding 
pockets (Fig. 2f) and other new potentially druggable ligand-binding 
pockets, which can serve as a basis for structure-guided computer- 
assisted drug design, in silico screening and structure determination of 
complexes with pharmaceutically relevant ligands. In this respect, we 
foresee two major applications: (1) reducing side effects of current 
antibiotics by improving drug specificity; and (2) developing new 
anticancer agents that target the human ribosome to specifically 
reduce the elevated protein synthesis rate of cancer cells that makes 
them a primary target over normally growing cells. The present high- 
resolution human ribosome structure will hopefully provide new 
opportunities in drug developments directly relevant for human 
health. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Purification. Human 80S ribosomes were prepared from HeLa cells, purified to 
homogeneity and characterized biochemically as described earlier’. Briefly, the 
HeLa cells were lysed and the cytosolic extract was then loaded onto 30% sucrose 
cushion for overnight centrifugation. The ribosomal pellet was treated with pur- 
omycin before the final step of sucrose gradient and PEG precipitation. 
Concentration of purified 80S was calculated using 1 A260 unit corresponds to 
20 pmol of 80S ribosome. 

Data collection. Freshly prepared human 80S ribosomes (2.5 pl of a 0.5 mg/ml 
solution) were applied to 300 mesh holey carbon Quantifoil 2/2 grids (Quantifoil 
Micro Tools, Jena, Germany), blotted with filter paper from both sides in the 
temperature- and humidity-controlled Vitrobot Mark IV apparatus (FEI, 
Eindhoven, Netherlands, T = 20°C, humidity 95%, blot force 5, blot time 0.5 s) 
and vitrified in liquid ethane pre-cooled by liquid nitrogen. Data were collected 
on the in-house spherical aberration (Cs) corrected Titan Krios S-FEG instru- 
ment (FEI, Eindhoven, Netherlands) operating at 300 kV acceleration voltage and 
at a nominal underfocus of Az = —0.6 to —4.5 jum using the second-generation 
back-thinned direct electron detector CMOS (Falcon II) 4,096 X 4,096 camera 
and automated data collection with EPU software (FEI, Eindhoven, Netherlands). 
The microscope was carefully aligned as well as the Cs corrector. The Falcon II 
camera was calibrated at nominal magnification of 59,000. The calibrated 
magnification on the 14 um pixel camera was 127,272X resulting in 1.1 A pixel 
size at the specimen level. The camera was set up to collect 3 frames (start from 
the second one) out of 17 possible, plus one total exposure image. Total exposure 
time was 1s with a dose of 606A * (or 3.5@A 2 per frame) (where é specifies 
electrons). 

Image processing. Before particle picking, stack alignment was performed, which 
included three frames and total exposure image (total four images in the stack). 
These four images in the stack were aligned by the whole image motion correction 
method". Thereafter, an average image of whole stack was used to pick 75,000 
particles semi automatically using EMAN2 Boxer“, and the contrast transfer 
function of every image was determined using CTFFIND3” in the RELION 
workflow”. 2D classification was used to remove bad particles (18,500) and 
particle sorting was done by 3D classification using 6 classes starting from 
56,500 particles, resulting in two dominant classes with 10,000 and 45,000 part- 
icles in rotated and non-rotated ribosome conformations, respectively (Extended 
Data Fig. 7); the subpopulation of the rotated ribosome contains elongation factor 
eEF2 and the ribosome is in a conformation similar to that of the previous lower- 
resolution human ribosome structure’. For the first steps of refinement, images 
were coarsened by 2 (C2-images) and 4 (C4-images) using RELION. The initial 
2D classification was done with the C4 images, followed by splitting of the 
different states of the ribosomes using C2 images. Only once the states were 
clearly separated, uncoarsened (C1) data was used. During the last step of refine- 
ment, only close to focus data up to defocus — 1.4 um (about 24,000 particles) was 
used. At the very last step, the ‘movie processing’ procedure as implemented in 
RELION 1.3 version was used with the 3 frames included in the calculations as 
individual frames. A post-processing procedure implemented in RELION 1.2 
(ref. 50) was applied to the final maps for appropriate masking, B-factor sharpen- 
ing and resolution validation to avoid over-fitting’’. In this procedure, the appro- 
priate B-factor was determined according to ref. 52. The resolution was estimated 
in RELION®, according to the ‘gold standard’ Fourier shell correlation 
(FSC = 0.143)°”° (Extended Data Fig. 1), indicating an average resolution of 
3.6 A. After calculating local resolution values with ResMap™ some regions are 
shown to reach ~2.4A resolution in the best cases, and up to 6 A in the more 
flexible regions. In general, the 40S subunit appears more flexible resulting in 
some less well-defined regions at lower resolution as compared to the core of the 
60S subunit, which is structurally more stable with the exception of the ES, L1 and 
P-stalk regions. Map interpretation was done using Chimera” and COOT” 
starting from the available atomic models". For this, starting atomic models of 
human and porcine ribosomes'* were placed as individual subunits, rebuilt and 


the RNA and protein sequences modified according to the human gene sequence. 
Most ribosomal proteins have about 95-100% conserved sequences in mammals 
except eL43, eL29, uL29, eL14 and eL6, which were rebuilt with the human 
sequences (ribosomal proteins follow the new nomenclature’). The atomic 
model was refined against the experimental map using Phenix”*, which included 
real space refinement, positional refinement, grouped B-factor refinement and 
simulated annealing. FEM maps were used for structure interpretation and fine- 
tuning the positions of side-chain conformations throughout the entire structure 
by manual model building, followed by simulated annealing protocols. We mon- 
itored the refinement process with R-factor values to avoid over-fitting, and 
refined the entire 80S structure at once (to ensure proper distance restraints 
between residues throughout the structure, notably at the subunit interface) by 
simulated annealing using parallel computing, rather than refining each protein 
subunit individually as for the recent porcine ribosome structure®. The R-factor 
value of the refined structure (35.7%) compares well with that of crystal struc- 
tures*® showing that the atomic model is well refined compared to crystallography 
standards. The final atomic model comprises ~220,000 atoms (excluding hydro- 
gens) across the 5,866 nucleotide residues and ~11,590 amino acids of the 80 
proteins and the 4 rRNAs (288, 5S, 5.8S and 188; excluding certain expansion 
segment rRNA which are only partially visible at the periphery of the structure 
probably due to conformational heterogeneity) and 239 Mg” ions. The structure 
extends the completeness of the recent porcine ribosome structure at the level of 
several regions, such as the position of some rRNA segments (however, most 
rRNA expansion segments remained poorly defined). This includes the region of 
the L1 rRNA stalk and the L1 protein that could be completed; albeit at lower local 
resolution, this region is rather well ordered due to the presence of a tRNA in the 
E-site, which was fully built in the atomic model. Side chains in various sites and 
pockets are largely well defined (for example, tRNA sites, PTC, decoding site, 
peptide channel, mRNA channel), unless they become ordered upon substrate 
binding (see main text); interestingly, the «-sarcin-ricin loop region (rRNA helix 
H91) is entirely ordered even in the absence of ribosomal GTPases (including the 
A4605 region, which is catalytically relevant for GTPase activation and the non 
base-paired G4600 located next to the G-domain of factors). Protein residues of 
the final atomic model show well-refined geometrical parameters (allowed 
regions 22.5%, preferred regions 67.9% and 9.5% of outliers in Ramachandran 
plots, r.m.s. bond deviations of 0.008A and angle deviations of 1.24°, which 
compare well with values observed for refined crystal structures). Figures were 
prepared using the software Chimera® and Pymol®. 
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Extended Data Figure 1 | Resolution estimation. The average resolution of the cryo-EM map as estimated from Fourier shell correlation according to the 0.143 


criterion. 
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Extended Data Figure 2 | Features of the refined cryo-EM 3D reconstruction. Final cryo-EM map of the peptidyl-transferase centre region illustrating the high 
level of details observed. 
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Extended Data Figure 3 | Typical electron density map regions of the human ribosome structure. These final electron density maps were obtained by 
combining experimental amplitudes derived from the cryo-EM map and phases calculated from the iteratively refined atomic model, as done in standard 
refinement procedures in X-ray crystallography (see main text). 
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Extended Data Figure 4 | Comparison of maps determined by cryo-EM and X-ray crystallography. Top and middle, cryo-EM map and Phenix map of the 
human 80S ribosome (this study, A4546 region); bottom, crystal structure of a 70S ribosome at 2.8 A resolution” (corresponding A2600 region). 
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Extended Data Figure 5 | Ribosomal protein structures. a, b, Structure of ribosomal proteins uS2 and eS17. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


E-tRNA 


(anti-codon region) 


18S rRNA 


b 


uL16 uL16 


°~' loop 
U4548 PTC U4548 
H93 4 


28S rRNA 


Extended Data Figure 6 | The decoding centre on the 40S. a, Contact regions _ and disorder of residue U4548 (28S rRNA) in the absence of P-site tRNA, while 
of the E-site tRNA on the 40S subunit. b, Catalytic PTC region of the ribosome, all other residues are well-ordered. This suggests that the P-site pocket is largely 
highlighting the partial structural disorder of a region of protein uL16 (loop), _ pre-defined while U4548 and the loop participate in tRNA accommodation. 
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Initial data set (75 000 particles, pixel size 1.1 A) 


2D classification (18 500 particles were removed) 


3D classification into 6 classes 


Class 1 Classes 2 &3 Classes5 &6 


Only 60S Low resolution 80S 
1,000 particles 500 particles 


10 000 particles 45 000 particles 


Ratcheted Non-ratcheted ribosome with 
ribosome with E-site tRNA 


EF2 b d 
Soe STS Extraction of low defocus 


dataset and movie processing 
(24 000 particles) 


Extended Data Figure 7 | Particle sorting scheme. Particle sorting was done _ rotated ribosome conformations respectively; the highest-resolution close-to- 
by 3D classification using six classes starting from 75,000 particles, resulting in focus data set was refined using movie processing (see Methods). 
two dominant classes with 10,000 and 45,000 particles, in rotated and non- 
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Extended Data Table 1 | Inter-subunit bridges and 60S and 40S residues involved in inter-subunit interactions 


40S subunit 60S subunit 
Bla uS19 Arg 10 28S 1766 
Blib/e uS13 Gln 10 uL5 Tle 113 
Arg 14 Lys 118 
B2a 18S 1707-1709 28S 3765-3768 
1826-1829 3759-3760 
1849-1850 3762 
1059 -1058 
B2b 18S 1051 28S 3699 
B2c 18S 1180 28S 3694-3695 
B2e 18S 938-939 eL43 | Arg 85 
B3 18S 1815-1819 28S 3805-3807 
1719-1721 3794-3796 
3628-3629 
3834 
eLAl | Arg 17 
Met 20 
B4 18S 1028-1029 28S 1563-1564 
677-678 
BS 18S 1731-1732 28S 2884-2885 
Bo 18S 1793 eL24 | Arg 47 
B7a 18S 970 28S 3710-3711 
3742 
B7b/c Not observed 
B8 Not observed 
eB8 eSl Glu 224 eL8 Glu 260 
Leu 225 Lys 264 
eB11 eS8 Arg 77 28S 5026-5027 
Asn 88 5024 
Glu 89 2893-2894 
Arg 92 3610-3611 
Asp105 3598-3599 
Gln 167 5016-5017 
Gly 169 
Lys 170 
Arg 200 
Arg 205 
eB12 18S 871 eL19 | Arg 163 
908-910 Arg 170 
914 Arg 172 
Arg 176 
eB13 eS6 Ala 33 eL24 | Lys 69 
Thr 34 Lys70 
Arg131 Arg71 
Tle 141 Thr 72 
Lys 143 Arg 73 
Leu 144 Arg 74 
Phe 145 Gln 79 
Asn 146 Arg 80 
Leu 147 Gly 84 
Ser 148 Lys 93 
Asp151 Arg 94 
Gln 155 Asn 95 
Tyr 156 Gln 96 
Lys 97 
18S 1779-1781 Arg 101 
1783 Arg 105 
1746 Glu 106 
1748-1750 Lys 117 
323 Lys 124 
327 
eB14 18S 1171-1175 eLAl | Met 1 
1182-1183 Arg 2 
1841-1847 Lys 4 
1705-1706 Arg 6 
Arg 9 
Arg 11 
Lys 14 
Arg 15 
Arg 17 
Arg 18 
Arg 21 


The contacting residues within 4A radius were selected using Contact (CCP4 suite of programs) and verified in the structure. 
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Extended hard-X-ray emission in the inner few 


parsecs of the Galaxy 


Kerstin Perez), Charles J. Hailey’, Franz E. Bauer®*°, Roman A. Krivonos®, Kaya Mori!, Frederick K. Baganoff”, 
Nicolas M. Barriére®, Steven E. Boggs®, Finn E. Christensen®, William W. Craig®?, Brian W. Grefenstette'®, Jonathan E. Grindlay", 
Fiona A. Harrison!®, Jaesub Hong", Kristin K. Madsen’, Melania Nynka’, Daniel Stern’, John A. Tomsick®, Daniel R. Wik’, 


Shuo Zhang’, William W. Zhang’ & Andreas Zoglauer® 


The Galactic Centre hosts a puzzling stellar population in its inner 
few parsecs, with a high abundance of surprisingly young, rela- 
tively massive stars bound within the deep potential well of the 
central supermassive black hole, Sagittarius A* (ref. 1). Previous 
studies suggest that the population of objects emitting soft X-rays 
(less than 10 kiloelectronvolts) within the surrounding hundreds 
of parsecs, as well as the population responsible for unresolved 
X-ray emission extending along the Galactic plane, is dominated 
by accreting white dwarf systems” >. Observations of diffuse hard- 
X-ray (more than 10 kiloelectronvolts) emission in the inner 10 
parsecs, however, have been hampered by the limited spatial reso- 
lution of previous instruments. Here we report the presence of a 
distinct hard-X-ray component within the central 4 x 8 parsecs, as 
revealed by subarcminute-resolution images in the 20-40 kiloelec- 
tronvolt range. This emission is more sharply peaked towards the 
Galactic Centre than is the surface brightness of the soft-X-ray 
population®. This could indicate a significantly more massive 
population of accreting white dwarfs, large populations of low- 
mass X-ray binaries or millisecond pulsars, or particle outflows 
interacting with the surrounding radiation field, dense molecular 
material or magnetic fields. However, all these interpretations pose 
significant challenges to our understanding of stellar evolution, 
binary formation, and cosmic-ray production in the Galactic 
Centre. 

The Galactic Centre region is dense with X-ray-emitting objects®; it 
contains the supernova remnant Sagittarius (Sgr) A East, the colliding 
stellar winds surrounding Sgr A*, the hot plasma of the Sgr A East 
plume, dozens of magnetic X-ray filaments, and thousands of resolved’ 
and unresolved point sources that constitute the Galactic ridge X-ray 
emisison***. In hard X-rays, the INTEGRAL satellite has detected 
emission centred within 1’ of the Galactic Centre’. However, the spa- 
tial resolution of INTEGRAL’s IBIS coded aperture mask (12’) has 
motivated speculation that the emission results not from a single 
object, but from a collection of the many surrounding X-ray sources”. 

The NuSTAR X-ray observatory’®, which has an effective area 
extending from 3 to 79 keV and an angular resolution of 18” (equival- 
ent to 0.7 pe at the Galactic Centre), viewed the Galactic Centre for a 
total of 281 ks in July, August and October of 2012. The image of the 
central 12 pc X 12 pc of the Galaxy (Fig. 1) in the 20-40 keV energy 
band reveals for the first time a faint diffuse emission that is peaked at 
the Galactic Centre and extends along the Galactic plane. The image is 
dominated by this feature, whose spectrum and localization within 
several parsecs of Sgr A* distinguish it from other unresolved X-ray 


emission in the Galaxy, and which has no obvious correlation with 
radio images of the dense molecular gas of the circumnuclear disk"! or 
the dust and gas of Sgr A West'”. The features prominent in soft-X-ray 
images no longer visibly contribute, with the exception of bright point- 
like emission from the pulsar wind nebula G359.95 —0.04”" and fainter 
emission from the X-ray filament G359.97—0.038" and the 
Cannonball’ neutron star. 


6.59 x 10-6 8.55 x 10° 1.05 x 10-5 


Flux (counts per pixel s~') 


4.62 x 10% 


Figure 1 | The 20-40 keV image of the inner 5’ x 5’ (12 pc X 12 pc) of the 
Galaxy. The colour scale shows flux in units of counts per pixel s_'. The image 
has been smoothed with a Gaussian kernel of width ~5” (2 pixels). The solid 
ellipse (white) illustrates the FWHM of the fit to the unresolved emission. 
Emission consistent with the pulsar wind nebula G359.95-0.04", as well as 
fainter emission from the Cannonball’’ and non-thermal filament G359.97- 
0.038", is also visible. The dashed ellipse (green) indicates the soft X-ray extent 
of Sgr A East'’. Spectra are extracted from the two thick-dashed regions (white 
polygons). 
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Figure 2 | Profiles of the 20-40 keV data and spatial model along Galactic 
longitude and latitude. a, Galactic latitude; b, Galactic longitude. Top panels, 
the data (flux in units of counts s! arcsec’, red data points; error bars, 1a) are 
fitted to a model consisting of an ellipsoidal Gaussian (dashed line), which 
describes the extended emission, and a point-like Gaussian (thin solid line), 
which describes G359.95 —0.04, both convolved with the on-axis NuSTAR PSF, 
as well as a background (dash-dot line) that varies between detector chips. 
Profiles are integrated from 25” on either side of each axis, with the origin 
defined at Sagittarius A*. The combined model (thick solid line) and residual 
emission (lower panel) show that this model describes the data well. 
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The spatial distribution of this new feature can be described by an 
elipsoudal Gaussian with a full-width at half-maximum (FWHM) of 

7.8*¢9 pc along the Galactic plane and 3.91} pc out of the plane 
(Fig. 2), where the distance to the Galactic Centre is set to be 8 kpc, the 
physical extent is quoted before convolution with the NuSTAR point 
spread function (PSF), and errors are 3 confidence limits. The cen- 
troid of this emission is compatible with the Galactic Centre, defined as 
the radio location of Sgr A* (right ascension 17 h 45 min 40.036 s, 
declination —29° 00’ 28.17”; J2000.0)'°, at the 2o confidence level. As 
the emission is centred ~1’ away from the centre of the Sgr A East 
thermal emission and extends asymmetrically >30” beyond the super- 
nova shell to the southwest, a hard-X-ray component of Sgr A East is 
ruled out as the origin. 

We performed spectral analysis in two regions at radii r ~ 1’-2' to 
the southwest and northeast of Sgr A* (see Fig. 1), chosen such that 
<4% of the emission from G359.95—0.04 contaminates each region 
(radii are measured from the radio location of Sgr A*). Since the 
NuSTAR spectral resolution” is insufficient to fully resolve the neutral, 
He-like, and H-like Fe emission lines, we used 2-10 keV XMM- 
Newton data from these same regions to constrain the temperatures 
of the plasma emission below 20 keV. 

The soft spectrum of the southwest region is dominated by point 
sources and truly diffuse emission (Fig. 3). Modelling this as two 
collisional-ionized plasmas'’ yields the best-fit temperatures 
kT, =1. oto3 3 keV and kT, =7.5* i e 3 keV, with a neutral Fe equivalent 
width of 126 +46 eV at 6.4 keV (90% confidence limits). These values 
are consistent with those previously obtained by Chandra analysis of 
faint point sources’ in the inner 9’ and diffuse emission’® in the inner 
r =~ 2'-17', as well as XMM-Newton analysis’ of the regions r = 2'-20’. 
The kT ~ 1.0 keV plasma is attributed to supernova heating of the 
interstellar medium, coronally active stars, and accreting non-magnetic 
white dwarfs*. The kT ~ 7.5 keV plasma is consistent with emission 
from resolved and unresolved accreting magnetic white dwarfs*** with 
mean white dwarf mass Mwp ~ 0.5Mo (Mo is the solar mass). The 
stellar-mass (M) normalized luminosity (L) of our thermal components 
is L(2-10 keV)/M ~ 9X10?” erg s | M3! at r ~ 3 pc, which is con- 
sistent with that measured by XMM-Newton’ at r ~ 4 pc. We therefore 
conclude that our kT, and kT> components describe the same emission 
established by previous X-ray observations. 
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Figure 3 | Unfolded broadband X-ray spectrum of the southwest region. 
Top panel, the 2-10 keV spectrum consists of XMM-Newton data from the pn 
(black), MOS1 (red) and MOS2 (green) instruments. Plotted on the y axis is flux 
in units of counts s_' keV! cm’. The 10-40 keV spectrum consists of 
NuSTAR focal plane A (dark blue) and B (cyan) data. Error bars are 1o. Above 


20 keV, the extended emission dominates, described well by a kT > 35 keV 
bremsstrahlung (here) or a photon index of J’ = 1.3-1.8 (Extended Data Fig. 4). 
Dashed lines illustrate separate model components. Bottom panel, fit residuals. 
Full spectral parameters are given in Extended Data Tables 2 and 3. 


30 APRIL 2015 | VOL 520 | NATURE | 647 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Above 20 keV, this thermal emission becomes negligible, and the 
spectrum is dominated by hard emission from our new feature. This 
emission can be described by either a power law with photon index 
Pa 5493 and flux Fyower-law(20-40 keV) = 6.7 X 10° ergs? 
cm” or a poorly constrained high-temperature bremsstrahlung at 
kThigh = = 58733 33 keV and flux Fy-r-nigh(20-40 keV) = 6.8 X 10°"? erg 
scm It is critical to note that this hard spectrum cannot be 
accounted for by extrapolating the previously detected soft thermal 
emission attributed to accreting magnetic white dwarfs. 

The X-ray energy Ex > 20 keV spectrum and flux of the northeast 
region are consistent with those observed in the southwest. The north- 
east contains significant soft emission from Sgr A East, which dom- 
inates over the point source and diffuse populations. Sgr A East is 
modelled as a two-temperature collisional-ionized plasma, yielding 
temperatures kT; =1.17$5 keV and kT? =5.178% keV, and metalli- 
city Z=2.3*05 Zo (where Zo is the solar metallicity), which are in 
agreement with Chandra and XMM-Newton measurements’’””. 
Again, this soft thermal emission does not contribute significantly 
above 20 keV, where the spectrum is fitted well by either a power 
law with [=1.67$3 and flux F,ower-taw(20-40 keV) = 7.6 X 10° 
erg s | cm” or a high-temperature bremsstrahlung with 
kT ign = 66" 35° keV and flux Fer-nigh(20-40 keV) = 7.4 X 10 * erg 
s ‘cm ~. Faint contaminating emission from the non-thermal X-ray 
filament G359.97-0.038"* accounts for the 14% difference in flux 
between the northeast and southwest regions. 

The uniform 20-40 keV spectral and flux values in both regions 
support the conclusion that the feature is symmetric along the Galactic 
plane around Sgr A* and is either thermal with kTyign > 35 keV or 
non-thermal with a photon index of [’ = 1.2-1.9 (90% confidence 
limits). Using the two-dimensional Gaussian spatial intensity model to 
scale the flux from the southwest region indicates a luminosity of 
L(20-40 keV) = 2.4 X 10° ergs‘ within the 4 pc X 8 pc FWHM 
of emission. This is <10% of the magnitude of the 20-40 keV lumin- 
osity of the INTEGRAL Galactic Centre source. 

The measured spectrum, luminosity and spatial profile constrain 
the possible origins of this emission, even given the broad parameter 
range of our phenomenological model. Truly diffuse sources, such as 
synchrotron radiation from magnetic filaments’ or low-surface- 
brightness pulsar wind nebulae”, or bremsstrahlung and inverse 
Compton emission from Sgr A* particle outflows (cosmic rays) inter- 
acting with dense molecular material, could produce the observed 
spectrum. This interpretation is challenging, though, as there is no 
spatial correlation with corresponding radio images” or radiation 
density models”, and our current estimates of magnetic filaments 
and supernova birth rates in this region cannot reproduce the observed 
luminosity (as described in Methods). 

A natural explanation for our emission is provided by the inter- 
mediate polar (a type of cataclysmic variable binary star), which has 
the hardest spectrum of all accreting magnetic white dwarfs. A mean 
kT > 35 keV implies a mean white dwarf mass** Mwp > 0.9Mo. This 
is se cv more massive than the population with mean 
Mwp ~ 0.5Mg previously observed in the Galactic Centre** and 
vidoe , or the Mwp =0.661}}?Mo population observed in the 
Galactic bulge®. It is further distinguished from the low-mass inter- 
mediate polar population by its surface brightness distribution, which 
falls more steeply with distance from the Galactic Centre. Refitting our 
20-40 keV image in the region r > 60” (to avoid biasing the distri- 
bution owing to the presence of G359.95—0.04) shows a decrease with 
angular offset from Sgr A*, 0, and latitudinal angular offset from Sgr 
A*, @, of 0 “exp(—|$1/Psc)s with « = 1.4 + 0.1 and latitudinal scale 
height ¢,. = 1.0’ + 0.1’ (lo errors). This is significantly narrower than 
the ~=0.6400? and $,.=18.6'*|'$ measured for the 2-10 keV dis- 
tribution’ at r > 2’. 

Our observed luminosity implies that about 10°-10* of these mas- 
sive intermediate polars would exist in the central 4 pc X 8 pc. Scaling 
the density of intermediate polars in the solar neighbourhood by the 
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stellar density of the Galactic Centre” and the fraction of sufficiently 
massive progenitor B-stars predicts about three orders of magnitude 
fewer systems. Though puzzling, this overabundance is of similar 
magnitude to that implied for the low-mass population’. 

A significant contribution from low-mass X-ray binaries or from 
millisecond pulsars may also be possible. The stellar density of faint 
X-ray transients is already known to be higher in the Galactic 
Centre**’’, However, for a population of low-mass black hole” 
neutron star”? X-ray binaries to account for the observed luminosity 
while remaining consistent with SWIFT X-ray outburst monitoring 
results’®, it must be composed of a rare class of very faint X-ray tran- 
sients*°”*. A sufficient population of millisecond pulsars would account 
for ~40% of the sources identified by Chandra. This in contrast to 
previous analyses identifying accreting magnetic white dwarfs as the 
majority**”, but could indicate a surprising abundance of unidentified 
millisecond pulsars in the region. This could strengthen the motivation 
for a millisecond pulsar explanation of the y-ray excess observed by the 
Fermi y-ray space telescope surrounding the Galactic Centre”. 

Regardless of origin, these observations reveal novel processes pre- 
sent within the inner parsecs of the Galaxy. If arising from a population 
of compact objects, the steeply falling spatial distribution suggests that 
binary systems are accumulating, forming, and/or experiencing accre- 
tion with a much higher probability in the inner parsecs than in the 
solar neighbourhood or the Galactic bulge. These observations thus 
provide input useful for studying accretion physics, dynamical forma- 
tion, and evolution of exotic binaries near the central supermassive 
black hole. A cosmic-ray origin, in contrast, has implications for mod- 
els of particle outflow from Sgr A*, as well as radiation and magnetic 
field distributions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


We use data from three NuSTAR observations of the Sgr A* region in 2012: 20-23 
July (154 ks), 4-6 August (77 ks), and 16-18 October (50 ks). These correspond to 
the observation IDs 30001002001, 30001002003, and 30001002004, respectively. 
Data reduction and spectral extraction were performed with the NuSTAR Data 
Analysis Software (NuSTARDAS) v1.1.0 software package. The HEASoft v6.13 
package was used for data analysis. 

Event selection. We removed all data from within 100 s of passage through the 
South Atlantic Anomaly and time periods corresponding to Sgr A* flares, as 
observed by NuSTAR or during coincident Chandra observations’’. Data from 
focal plane B of the July and October observations are not included due to stray 
light'® from the nearby bright objects GX 3+1 and 1E1740-2942. After data 
screening and combining available focal plane A (FPMA) and B (FPMB) observa- 
tions, the analysis contains ~297 ks of effective exposure time. 

Image analysis. The nominal NuSTAR image coordinates are accurate to 8” (90% 
confidence level)’*. To improve our positional accuracy, we derived an astrometric 
correction for each observation by registering known sources. For the July and 
October observations, we used the radio position of Sgr A*’®, the NuSTAR posi- 
tion of the Sgr A* flare in the 3-79 keV band, the Chandra position of Sgr A-E”, 
and the NuSTAR position of Sgr A-E”' in the 3-10 keV band for source registra- 
tion. For the August observation, where the lack ofa flare makes localization of Sgr 
A* difficult with NuSTAR, we instead used the Chandra position of the 
Cannonball’’, the NuSTAR position of the Cannonball’* in the 10-40 keV band, 
the Chandra position of Sgr A-E, and the NuSTAR position of Sgr A-E in the 3-10 
keV band. 

After each photon event file was position corrected, we combined NuSTAR 
images in sky coordinates and normalized the resulting image by an effective 
exposure map that accounts for observation time, but not vignetting effects. 
Since the optical axis was near Sgr A* for all observations and detector background 
rates in the 20-40 keV energy band are non-negligible, vignetting was accounted 
for in the spatial fitting by convolving the source components, but not detector 
background components, with the on-axis NuSTAR point spread function (PSF). 
We defined a systematic error on the astrometric correction to be the maximum 
distance between the reference positions and the NuSTAR centroid positions of 
the Cannonball and Sgr A-E knot in the resulting image, yielding uncertainties of 
1.0” in RA and 1.2” in Dec. This is in addition to the 4” spatial uncertainty from the 
PSF smearing effect due to incomplete aspect reconstruction”. 

We used the Sherpa’ package to fit a phenomenological model to the 20-40 
keV raw photon count sky image. Due to low source counts, the C-statistic was 
used. This model was composed of one symmetric point-like Gaussian and one 
ellipsoidal extended Gaussian source, both convolved with the on-axis NuSTAR 
PSF stored in the NuSTARDAS v1.1.0 CALDB database, and a flat internal back- 
ground that varies between detector chips. The background map was derived using 
blank sky observations to fix the relative normalization of the detector background 
between each chip. In this energy range, the stray light background due to the 
cosmic X-ray background and Galactic ridge X-ray emission are subdominant 
compared to the internal background. The overall normalization of the back- 
ground map was left as an independent fit parameter. The total effective exposure 
map was used to normalize all three fit components. The final model used for 
fitting was thus: 


Psf x (gauss2d.sourceOne + gauss2d.sourceTwo) * emap 


+bkgdmap * emap 


where psf is the on-axis NuSTAR PSF, gauss2d.sourceOne is the point-like 
Gaussian, gauss2d.sourceTwo is the ellipsoidal extended Gaussian, bkgdmap is 
the background map, and emap is the effective exposure map. Here, * indicates 
multiplication and X indicates convolution. To minimize variations in back- 
ground and detector response across different detector chips, and to avoid emis- 
sion from the molecular cloud regions to the northeast, the fit was performed 
within a radius r < 3’ from the Galactic Centre. 

The 20-40keV model and residuals in the fit region are shown in Extended Data 
Fig. 1. No remaining significant structure is present in the residual map. The 
component gauss2d.sourceOne fits to a FWHM of 1.8*33", where 2.5” is one 
image pixel. This is a lower limit on the spatial extent, consistent with a source 
small compared to the 18” (FWHM) NuSTAR PSF. The second component fits to 
a Gaussian with FWHM = 195.8*?°" along the major axis, FWHM = 101.8 4"" 
along the minor axis, and an inclination of 0 = 57° from the positive northern axis. 
The centroid of this component is located at ARA=—2.2+43" and 
ADec.=2.8131" from Sgr A*. Because calculating fit errors while leaving all 
parameters free was too computationally intensive, the amplitude of the back- 
ground map and the extended component ellipticity and 0 were fixed during error 
calculation. All fit parameters are listed in Extended Data Table 1. Consistent 


values of centroid location and spatial extent are obtained by fitting a single 
ellipsoidal Gaussian and background model in a region that excludes the central 
60” radius, thus minimizing emission from G359.95-0.04. 

To compare the NuSTAR 20-40 keV morphology with the 2-10 keV surface 
brightness distribution, we re-fitted the NuSTAR data with the spatial model used 
by the Heard andWarwick analysis of XMM-Newton data in the region r = 2’-20': 


S=N0 “exp(—|@|/@,.) + const. (1) 


This model describes a surface brightness S falling as a function of the angular 
offset 0 (measured in arcminutes) from the position of Sgr A* and the latitude 
angle ¢, measured with respect to the latitude of Sgr A*. The parameters N, « and 
sc represent the normalization of the power law, the power-law index, and the 
latitudinal scale height in arcminutes, respectively. 

To avoid biasing the steepness of the distribution, we excluded the central r << 
60” region that contains the bright pulsar wind nebula G359.95—0.04. We also 
exclude the emission from r < 20” surrounding the Cannonball’. The constant 
background term const. is obtained by fitting the same background map as used in 
our two-Gaussian phenomenological model to the region r = 300"--400", yielding 
the same background normalization as obtained in the two-Gaussian model 
above. We note that this const. term does not represent the true underlying sky 
surface brightness, as it contains a large contribution from instrumental back- 
ground. Fitting in the region r = 60"-180" constrains the spatial parameters to be « 
= 14+ 0.land ¢,. = 1.0’ + 0.1' (1¢ errors). This is significantly narrower in both 
longitude and latitude than the soft distribution observed by Heard and Warwick, 
who obtained «=0.6%9°3 and ¢,.=18.61!5. The effect of vignetting is not 
important this close to the optical axis. Re-fitting the data using an exposure 
map that includes the effect of vignetting, calculated at 25 keV, yields consistent 
values of « = 1.2 + 0.l and ¢,. = 1.0’ + 0.1’. 

In order to fit over the full r << 180” region and to further compare with our two- 
Gaussian model, we fix the magnitude and position of the point-like Gaussian 
describing G359.95—0.04 to the same values as listed in Extended Data Table 1. 
Re-fitting the distribution S in this full region yields « = 1.0’ + 0.1’ and $.¢ = 1.1’ 
+ 0.1’. The profiles of the total 20-40 keV emission along Galactic longitude and 
latitude, along with this best-fit model result, are shown in Extended Data Fig. 2. 
Spectral analysis. Using the event files with astrometric corrections applied, we 
extracted source spectra from the dashed regions shown in Fig. 1. Since the 20-40 
keV NuSTAR image is dominated by internal detector background, the back- 
ground spectra were extracted from regions located on the same detector chip 
that contains the Sgr A* region, but outside the molecular cloud regions (see 
Extended Data Fig. 3). 

We used 2-10 keV data from the XMM-Newton observations carried out on 
2012 August 31 (observation ID 0694640301) and 2012 September 24 (obser- 
vation ID 0694641101), reprocessed with SAS v12.0.1. No bright X-ray outbursts 
occurred during these periods. We extracted EPIC-pn and EPIC-MOS spectra 
from the regions used for the NuSTAR analysis and followed the standard data 
screening procedures from the September 2013 XMM-Newton ABC guide. 

The resulting data from the MOS1, MOS2, and pn instruments sum to ~210 ks 
of effective exposure time. For MOS1, the 2012 September 24 observation fell on a 
dead detector chip and is not used. Since the XMM-Newton spectra are dominated 
by soft diffuse emission that fills the central arcminutes, the background spectra 
were extracted from dimmer regions located at least 10’ to the northwest of the 
Sagittarius A* region. 

Spectral fitting and flux derivations were performed in XSPEC version 12.8.0°, 
with photoionization cross-sections as defined in ref. 35 and abundances for the 
interstellar absorption as defined in ref. 36. All spectra were re-binned so that each 
spectral bin contained at least 20 counts above the estimated background level, 
obtained by scaling the background spectrum to the source extraction area. 

To analyse the southwest and northeast region spectra of XMM-Newton and 
NuSTAR, we compared the models const.*tbabs*(apec + apec + gaus + po) and 
const.*tbabs*(apec + apec + gaus + bremss), where const. is a constant factor, 
tbabs is the neutral hydrogen absorption model, apec is a collisionally ionized 
plasma model, gaus is a Gaussian emission line model centred at 6.4 keV with 
width fixed to ¢ = 0.01 keV (to model neutral Fe emission), po is a power-law 
model, and bremss is a thermal bremsstrahlung model. The final spectral fit values 
are not significantly sensitive to the width of the Gaussian fixed at 6.4 keV. The 
thermal bremsstrahlung model was chosen to describe possible very-high-tem- 
perature (>30 keV) emission, where emission lines would no longer be present. To 
account for different normalizations between instruments, the constant factor was 
allowed to fit freely to each data set. 

The best-fit parameters and 90% confidence level statistical errors are listed in 
Extended Data Tables 2 and 3. In the southwest region, the lower-temperature 
component has a best-fit metallicity that is very high, Z = 5Z., but varying this 
parameter has no effect on the spectral parameters of the higher-energy compo- 
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nents. The higher-temperature component fits to a metallicity Z = 1.7Z. This is 
consistent with emission from unresolved magnetic accreting white dwarfs, with 
slightly enhanced metallicity because the lighter elemental abundances, which are 
measured to be Z > Zo in the Galactic Centre’’, are linked to the Fe abundance in 
the apec model. Because a degeneracy exists between the values for the abundance 
of the kT, component, the normalization of the kT, component, and the normal- 
ization of the power-law component, the kT> abundance was fixed to its best-fit 
value during the calculation of parameter error intervals. The hard component can 
be modelled equally well by either a thermal bremsstrahlung, as shown in Fig. 3, or 
a power law, as shown in Extended Data Fig. 4. In the northeast region, the hard 
emission can be described by these same models, as shown in Extended Data Figs 5 
and 6. The non-thermal spectrum of G359.97—0.038" could contribute at most 
19% of the flux observed in the northeast region (at the 90% confidence level). 
Spectral analysis of central r = 20". Our spatial model and the asymmetric surface 
brightness distribution indicate that our observation cannot be attributed to PSF 
leakage from a bright, central source. To further support this claim, we perform 
spectral analysis of the r = 20" region surrounding the point-like source attributed 
to G359.95—0.04. The radius was chosen to maximize the ratio of the point-source 
flux to the underlying extended flux. The software versions, background regions, 
and spectral re-binning are identical to those used above. 

We first fit the NuSTAR data in the energy range 20-60 keV with a power-law 

model, po, yielding / =2.2*). This is significantly softer than the emission 
measured in the southwest or northeast regions. To better constraint the photon 
index, we again combine the 2-10 keV XMM-Newton data with 10-60 keV 
NuSTAR data. The soft emission is dominated by Sgr A East, with best-fit tem- 
peratures and abundances consistent with Chandra and XMM-Newton measure- 
ments of this region’®”°. The hard emission is described by [ =2.0*?3, again 
significantly softer than the best-fit values of the extended emission. 
Population analysis. Truly diffuse sources. Chandra has identified 17 X-ray-emit- 
ting non-thermal filaments in the central 20’ X 20’ region”, with photon indices 
ranging within J” ~ 1.0-2.0. Extrapolating the combined 2-10 keV luminosity of 
the six filaments (excluding G359.95—0.04) that lie within the central 4 pc X 8 pe 
region using our best-fit [= 1.6, we see that they could contribute at most 30% of 
the observed luminosity. Also, no X-ray filaments are identified in our southwest 
region. The faintest filament observed by Chandra has a luminosity of L(2-10 keV) 
= 2.0 X 10° ergs’. Interpreting this as a minimum Chandra detection threshold 
and again extrapolating using the best-fit photon index implies that ~100 addi- 
tional filaments would be necessary to account for the total luminosity. Radio 
emission, which is observed from ~25% of X-ray filaments*’, would thus be 
expected to trace our observed morphology. However, no such similar structure 
is observed in recent 6 cm radio maps’. 

A population of low-surface-brightness pulsar wind nebulae** could also con- 
tribute. Again assuming L(2-10 keV) = 2.0 X 10° erg s | is the minimum 
luminosity at which Chandra would detect an extended object, ~100 such objects 
are required. However, simulations indicate that only a few dozen should exist, 
spread over the central r = 20’ of the Galaxy”. Thus an order of magnitude higher 
supernova birth rate in the Galactic Centre than currently assumed would be 
required to explain our observations. 

In addition to particle outflows from Sgr A*, cosmic rays from shocked stellar 
winds combined with the high radiation density of the region could produce an 
inverse Compton origin. To account for the observed 20-40 keV luminosity of the 
southwest region, which is at a distance of r ~ 2-5 pc, a cosmic-ray density similar 
to that of the central r< 1 pcand a radiation density of ~10,000 eV cm * would be 
required*®. However, the radiation density decreases rapidly with distance from 
Sgr A**, falling from ~5,000 eV cm? in the inner parsec to ~ 50 eV cm ?ata 
distance of 10 pc, and the cosmic-ray density is also expected to decrease rapidly 
away from the Galactic Centre. 

A more exotic origin of electrons could be the annihilation of dark-matter 
particles. However, the electron energies required to produce the observed spec- 
trum are too large to be produced by even the heaviest dark-matter candidates”. 
The NuSTAR emission is also much more centrally localized and asymmetric than 
would be predicted for dark-matter annihilation in the Galactic halo*’. 
Intermediate polars. Our estimate of the necessary number of intermediate polars 
to reproduce the observed spectrum is based on assumed* 2-10 keV luminosities 
ranging from Lyin ~ 10°°-107! erg s | to Lmax ~ 10°° erg s ', with an integral 
source luminosity distribution N(>L) « L~” and spectral index « ~ 1-1.5. 

The bremsstrahlung model we have used in our spectral fitting approximates 
the spatially dependent temperature of the cooling accretion column of an inter- 
mediate polar by an average colour temperature. To directly determine the mean 
mass limit implied by our observations, we refit the spectra of the southwest and 
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northeast regions using the model of ref. 24. Since our soft components agree with 
previously measured values, we constrain these parameters to stay within their 
90% confidence intervals. This yields a 90% confidence level lower limit on the 
mean white dwarf mass of Mwp > 0.9Mo. As with the high-temperature brems- 
strahlung model, the upper limit is not well constrained. 

Low-mass X-ray binaries. Our knowledge of the luminosity of quiescent black hole 
low-mass X-ray binaries is limited, based on a sample of only 17 systems’. For 
typical luminosities of L(2-10 keV) ~ (2-4) X 10°! erg s |, ~10° such systems 
would be required inside the central 4 pc X 8 pc to account for our observed 
luminosity. In the last decade, X-ray monitoring surveys**” are believed to have 
uncovered virtually all transient systems within the inner 50 pc with recurrence 
times of less than ~5-10 yr, outburst durations longer than a few days, and 
outburst luminosities L(2-10 keV) >~10** ergs '. Assuming a black hole binary 
outburst recurrence time of T, ~ 50-100 yr and durations ranging from weeks to 
months” and using the integrated Swift monitoring time implies ~ 12-50 tran- 
sient events should have been detected from an underlying population of ~10° 
black hole systems. However, only two new transient sources that could plausibly 
be attributed to low-mass black hole systems have been observed in that time. A 
large population of neutron star low-mass X-ray binaries is also inconsistent with 
X-ray monitoring surveys, assuming a mean L(2-10 keV) ~ 10° erg s_' and 
recurrence times”’ of ~5-10 yr. 

Millisecond pulsars. Some millisecond pulsars exhibit non-thermal emission with a 
typical photon index J’ ~ 1-2 (ref. 44), consistent with our observed spectrum. To 
study the X-ray luminosities of these objects, we selected 125 millisecond pulsars 
from the Australia Telescope National Facility Pulsar Catalog** with spin periods 
and dipole magnetic field strengths consistent with the observed hard-X-ray emit- 
ting millisecond pulsars, P< 10 ms and B < 10"! G. The spin-down power of the 
selected objects, E, ranges from 4 X 10°” to2 X 10*° ergs | with a mean E = 6 X 
10° ergs‘. Using the relation between non-thermal luminosity and spin-down 
power established by previous studies**, L(2-10 keV) ~ 10 **, we obtain a mean 
non-thermal X-ray luminosity of L(2-10 keV) = 6 X 10*° ergs” '. Approximately 
3,000 millisecond pulsars with this mean X-ray luminosity would be required to 
account for the our observations. Over 100 of these sources would have a lumin- 
osity of L(2-10 keV) >~ 4 X 10°! erg s_', which is the Chandra point-source 
detection limit for this region’. This implies that over 40% of the sources identified 
by Chandra within the central 4 pc X 8 pc are millisecond pulsars, a claim that 
requires further study and is inconsistent with the intermediate polar interpreta- 
tion of the majority of these sources”*’. 

Sample size. No statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | The best-fit 20-40 keV spatial model and residual __ chosen to highlight the extended emission, in units of total counts per pixel (bar 


image. a, The model, consisting of a point-like Gaussian, an extended at bottom). b, The residual image (observed minus model). Insets in a and 
Gaussian, and a background model as described in the main text. The radio b showa magnified view of the central 40” x 40” of the main panels, with lo, 20 
position of Sgr A*'* and the Chandra positions of the Cannonball”, and 3c error contours on the centroid positions of the extended (magenta) and 


G359.95—0.04"° and G359.97— 0.038” are overlaid. The FWHM extent of the _ point-source (green) two-dimensional Gaussian models, as well as the position 
two-dimensional Gaussian fit to the extended emission is indicated by the solid of Sgr A* (cross) and G359.95—0.04 (polygon). 
ellipse. The image is shown with a linear grey-scale distribution with the range 
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Extended Data Figure 2 | Profiles of the 20-40 keV data and exponential 
spatial model along Galactic longitude and latitude. a, Galactic latitude; 

b, Galactic longitude. Top panels, the data (red, 1o errors) are fitted to a two- 
dimensional model consisting of the surface brightness model used in ref. 5 and 
described in the main text (dashed line) and a point-like Gaussian source (thin 
solid line) that describes the emission from G359.95-0.04, both convolved with 
the on-axis NuSTAR PSF, as well as a flat background (dash-dot line) that varies 
between detector chips. Profiles are obtained by integrating from 25” on either 
side of each axis, with the origin defined at the position of Sgr A*. The 
combined model (thick solid line) and residual emission (lower panels) show 
that this phenomenological model describes the data well. 
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Extended Data Figure 3 | Regions used for extraction of background spectra. NuSTAR 20-40 keV image. The colour scale shows units of total counts per 
for NuSTAR observations 30001002001 (red), 30001002003 (white), and pixel. The x and y axes indicate coordinates of right ascension and declination, 
30001002004 (green). The regions used for spectral analysis of the diffuse respectively. 

emission are indicated by the dashed lines. These regions are overlaid on the 
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Extended Data Figure 4 | Unfolded 2-40 keV energy spectrum from the 
southwest region indicated in Fig. 1. Top panel, the 2-10 keV spectrum is 
constructed from XMM-Newton data from the PN (black), MOS1 (red), and 
MOS2 (green) instruments. The 10-40 keV spectrum is constructed from 
NuSTAR focal plane A (dark blue) and focal plane B (cyan) data. The fit to the 
data comprises two absorbed thermal plasmas plus an absorbed power-law, 


multiplied by a separate normalization factor for each data set to account for 
small calibration differences between instruments. The dashed lines indicate 
the separate model components. Fit residuals are shown in the lower panel. Full 


parameters of the spectral model are given in Extended Data Table 2. Error bars, 
1o statistical errors. 
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Extended Data Figure 5 | Unfolded 2-40 keV energy spectrum from the normalization factor for each data set. Fit residuals are shown in the lower 
northeast region indicated in Fig. 1. Top panel, the spectrum is composed as _ panel. Full parameters of the spectral model are given in Extended Data Table 2. 
in Extended Data Fig. 4. The model fitted to the data comprises an absorbed Error bars, 1o statistical errors. 

two-temperature thermal plasma plus a power-law, multiplied by a separate 
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Extended Data Figure 6 | Unfolded 2-40 keV energy spectrum from the separate normalization factor for each data set. Fit residuals are shown in the 
northeast region indicated in Fig. 1. The spectrum is composed as in lower panel. Full parameters of the spectral model are given in Extended Data 
Extended Data Fig. 4. The model fitted to the data comprises an absorbed two- _ Table 3. Error bars, 1o statistical errors. 

temperature thermal plasma plus a thermal bremsstrahlung, multiplied by a 
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Extended Data Table 1 | Best fit 2D Gaussian models 


Parameter G359.95-0.04 CHXE 
R.A. (Center, J2000) 266.4150°%%" —-266.4172° FS 
DEC (Center, J2000) -29,007245°73" —_-29.00716° 58." 
FWHM [arcseconds] 1.8173 195.8777 
Amplitude [10% cts s~*] Lr ae 0.006470 7 
Ellipticity 0.52 

6? [degree] - 57 


Parameters of the best-fit two-dimensional (2D) Gaussian models describing the extended hard-X-ray emission and the central point-like source, which is spectrally and spatially consistent with the pulsar wind 


nebula G359.95—0.04. CHXE is the 20-40 keV extended central hard X-ray emission. The errors quoted are 3c statistical errors. All parameters refer to source models before convolution with the NuSTAR PSF. 
*The angle 0 is defined with respect to the positive northern axis. 
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Extended Data Table 2 | Fits with hard emission modelled as power law 


Parameter Southwest Northeast 
PN norm.? tees Tate 
MOS1 and MOS2 norm. 1.0285 TAS 
NuSTAR FPMA norm. (fixed) 1.0 1.0 
NuSTAR FPMB norm. 12 epee 
Ny [107? cm~*] 14.1 +33 16.4 +h2 
i seas 16ers 
Nr [10~* photons cm~? s~? keV~1] i pa 16 285 
kT, [keV] 10.04 Tells 
Nr, [10~* photons cm~? s~! keV~? 10.8 Th 89.9 +997 
Zi[Zo]? BOs. 2.3108 
kT» [keV] Zo tts 5a oe 
Nr» [10~* photons cm~? s~* keV~? g.2tit 175-27! 
2Z[Zol° 1.7 2.370% 
Fe K-a eq. width [eV] jasts 47te 
x7/d.0.F. 1.00 (503.4/503) 1.05 (807.1/770) 
Fx (20-40 keV) [107° ergs cm~? s~1]¢ 7.6 8.0 


Spectral model of the two extended emission regions in the energy range 2-40 keV, obtained from a joint fit of XMM (2-10 keV) and NuSTAR (10-40 keV) data, with the hard emission modelled as a power law. All 
quoted errors are at 90% confidence level (CL). Ny, absorption column; N;, Nuri, Ner2 are the normalizations for the power-law, kT, and kT2 components, respectively. “Relative normalizations between different 


instruments, defined with respect to NUSTAR FPMA. ’Abundance relative to solar. These are best-fit values. In the southwest, Z> was then fixed during error calculations, as described in the text. “Metallicities are 
independent (linked) for the two temperature components in the southwest (northeast). “Observed flux. 
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Extended Data Table 3 | Fits with hard emission modelled as thermal bremsstrahlung 


Parameter Southwest Northeast 
PN norm.? 1 a diene 
MOS1 and MOS2 norm. i Lore 
NuSTAR FPMA norm. (fixed) 1.0 1.0 
NuSTAR FPMB norm. ieee) eae 
Ny [1022 cm~?] 13.4 718 16.4 753 
kT bremss [keV] bee 66 +3 
Nbremss [10~* photons cm~? s~! keV~1] 18008 1.9404 
kT; [keV] Ot. eee 
Nr, [10~* photons cm~* s~? keV~*] 9.5 +3) 93.2 17-9 
Zi[Zo]” 5Ot es 2.2t0% 
kT» [keV] Tee 5.0 te 
Nxr, [10~* photons cm~* s~? keV~*] 8.8 +70 18.0 *$% 
2[Zol* 1.6 22108 
Fe K-a eq. width [eV] 123t? 46+35 
x7/d.0.F. 1.00 (501.6/503) 1.05 (807.1/770) 
Fx (20-40 keV) [1071 ergs cm? s~1]9 7.3 8.0 


Model and footnotes as in Extended Data Table 2, but with the hard emission modelled as a thermal bremsstrahlung. Noremss is the normalization of the high-temperature bremsstrahlung component. 
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Topological valley transport at bilayer graphene 


domain walls 


Long J u'*, Zhiwen Shi!*, Nityan N air, Yinchuan Lv!, Chenhao Jin!, Jairo Velasco Jr!, Claudia Ojeda-Aristizabal'’, Hans A. Bechtel”, 
Michael C. Martin’, Alex Zettl'*"*, James Analytis'** & Feng Wang'*4 


Electron valley, a degree of freedom that is analogous to spin, can 
lead to novel topological phases in bilayer graphene. A tunable 
bandgap can be induced in bilayer graphene by an external electric 
field’°, and such gapped bilayer graphene is predicted to be a topo- 
logical insulating phase protected by no-valley mixing symmetry, 
featuring quantum valley Hall effects and chiral edge states*’. 
Observation of such chiral edge states, however, is challenging 
because inter-valley scattering is induced by atomic-scale defects at 
real bilayer graphene edges”®. Recent theoretical work’ ” has shown 
that domain walls between AB- and BA-stacked bilayer graphene 
can support protected chiral edge states of quantum valley Hall 
insulators. Here we report an experimental observation of ballistic 
(that is, with no scattering of electrons) conducting channels at 
bilayer graphene domain walls. We employ near-field infrared nano- 
metre-scale microscopy (nanoscopy)'*”* to image in situ bilayer 
graphene layer-stacking domain walls on device substrates, and we 
fabricate dual-gated field effect transistors based on the domain 
walls. Unlike single-domain bilayer graphene, which shows gapped 
insulating behaviour under a vertical electrical field, bilayer gra- 
phene domain walls feature one-dimensional valley-polarized con- 
ducting channels with a ballistic length of about 400 nanometres at 
4kelvin. Such topologically protected one-dimensional chiral states 
at bilayer graphene domain walls open up opportunities for explor- 
ing unique topological phases and valley physics in graphene. 

Hexagonal two-dimensional crystals, such as graphene and trans- 
ition-metal dichalcogenides, exhibit a pair of degenerate bands at the K 
and K’ valleys in momentum space. The valley electrons are charac- 
terized by non-trivial Berry curvatures, which give rise to anomalous 
quantum Hall states in graphene’’ as well as valley Hall effects (the 
valley counterpart of spin Hall effects'*'’). Bilayer graphene provides 
an attractive platform from which to explore topological valley phys- 
ics: a tunable semiconductor bandgap can be induced in bilayer 
graphene using a vertical electrical field, and different topological 
phases can exist in gapped bilayer graphene®”"’”'”’. In particular, 
quantum valley Hall insulator states can exist in bilayer graphene (with 
any edge termination other than the armchair type) with a field- 
induced bandgap, where counterpropagating chiral electrons with 
opposite valley index exist at their boundaries*°. Such one-dimen- 
sional chiral states are topologically protected as long as there are no 
inter-valley scatterings generated by atomic-scale defects. 

Domain walls between AB- and BA-stacked bilayer graphene pro- 
vide an attractive place to realize one-dimensional chiral boundary 
states of quantum valley Hall insulators because a smooth domain wall 
preserves the electron valley index in bilayer graphene, unlike valley- 
mixing atomic defects at graphene edges. The integrated Berry curv- 
ature of bilayer graphene valence bands for each valley is characterized 
by a quantized “valley” Chern number’’”’, the sign of which depends 
both on the direction of the vertical electrical field and the layer 


stacking order. Martin et al." first proposed topological domain 
walls between band-inverted bilayer graphene insulators generated 
by opposite vertical electrical fields. Experimental realization of such 
electric-field walls, however, is extremely challenging. An alternative 
approach is to exploit different stacking orders. AB- and BA-stacked 
bilayer graphene under the same vertical electrical field represent two 
distinct topological phases with opposite valley Chern numbers. The 
stacking boundary between AB and BA domains forms a layer stacking 
domain wall (Fig. la). Valley-polarized chiral electrons are pre- 
dicted'*"* to flow along such a domain wall in gapped bilayer graphene, 
where K and K’ valley electrons propagate in opposite directions. 
Recent transmission electron microscopy studies show that smooth 
AB-BA domain walls with a width of several nanometres exist naturally 
in suspended bilayer graphene”, but such suspended transmission 
electron microscopy bilayer graphene samples on through-holes (that 
is, holes that penetrate through a substrate) cannot easily be com- 
bined with bandgap control and transport measurements. Here we 
use a new approach to image in situ AB-BA domain walls in exfo- 
liated (that is, prepared from graphite using tape) bilayer graphene 
on SiO,/Si substrates, and demonstrate topologically protected 
charge and valley transport along such boundaries for the first time. 

Few-layer graphene was mechanically exfoliated on 280-nm-thick 
SiO,/Si substrates, and the layer thickness is determined using optical 
contrast measurements. To identify AB-BA layer stacking domain walls 
in bilayer graphene we used near-field infrared nanoscopy”, as illu- 
strated in Fig. 1b. Infrared light at 6.1 j1m was focused onto the apex ofa 
metal-coated atomic force microscope (AFM) tip with curvature radius 
r= 25nm, and the scattered infrared radiation was collected and mea- 
sured by an HgCdTe detector in the far field. The tip-enhanced infrared 
scattering provides a local probe of the material infrared responses with 
~40 nm spatial resolution. The AB- and BA-stacked bilayer graphene, 
being inversion symmetric to each other, have exactly the same elec- 
tronic bands and infrared absorption. The local band structure of the 
domain wall, however, is strongly modified relative to the bulk and 
exhibits different infrared contrast. Figure lc and d displays AFM 
topography and near-field infrared nanoscopy images of the same 
bilayer graphene on a SiO,/Si substrate. The bilayer graphene topo- 
graphy shows a step height of ~1 nm, consistent with the bilayer thick- 
ness. Within the bilayer graphene flake, the topography is featureless 
with a height fluctuation of less than 0.2 nm. In the near-field infrared 
nanoscopy image, however, distinct features appear in the bilayer gra- 
phene: multiple bright lines extend across the bilayer graphene flake 
and terminate at the edges. These bright lines in the infrared image 
indicate a change of local infrared conductivity in the bilayer graphene. 
No corresponding features can be discerned in the topography image 
(Fig. 1c), which excludes the possibility of surface contaminations or 
graphene folding along these lines. We attribute these local infrared 
response changes to AB-BA domain walls in bilayer graphene. 
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Figure 1 | Imaging AB-BA domain walls in exfoliated bilayer graphene. 

a, Illustration of a domain wall (shaded area) between AB- and BA-stacked 
bilayer graphene domains. The yellow (green) arrow indicates the one- 
dimensional conducting channel at the domain wall from K-valley (K’-valley) 
electrons. b, Illustration of the near-field infrared nanoscopy measurement of 
graphene on a SiO,/Si substrate (pink/grey). Infrared light (yellow) at 6.1 pm 
was focused onto the apex of a metal-coated AFM tip and the backscattered 
infrared radiation was collected and measured by an HgCdTe detector in the far 


Qualitatively, we can approximate bilayer graphene in the domain 
wall region as randomly stacked bilayers, which has stronger local 
infrared response than bulk AB-stacked bilayer graphene and leads 
to the bright lines observed in infrared nanoscopy images (see 
Methods for details). 

To confirm that the bright lines within the bilayer graphene are layer 
stacking domain walls, we systematically examined near-field infrared 
images of exfoliated graphene flakes of different layer thicknesses. 
Figure 2a-c shows infrared nanoscopy images of representative mono- 
layer, bilayer, and trilayer graphene flakes, respectively. The infrared 
images of monolayers, like that in Fig. 2a, always show homogenous 
responses within the flake, although bright features can be observed 
parallel to the edges owing to the excitation of graphene plasmons. The 
absence of any structural domain boundary in monolayers is consist- 
ent with the high quality of the exfoliated samples. In bilayer graphene, 
about 30% of the exfoliated flakes show bright lines across the interior 
region of graphene like those in Fig. 2b, and areas between these lines 
have the same contrast. We identify these bright lines as AB-BA 
domain walls because they are the only one-dimensional structure that 
exists in bilayer but not monolayer graphene. This identification is 
further bolstered by the infrared images of graphene trilayers, such 
as that shown in Fig. 2c. 

Bright lines extending through the graphene flakes are also 
observed, but unlike graphene bilayers, the domains separated by 
the bright lines can have different infrared contrast. This is because 
more domain stacking orders can exist in the trilayer, such as ABA, 
BAB, or ABC stacking. Although ABA and BAB domains are inversion 
symmetric to each other and have the same electronic structure, ABC 
domains have distinct band structures*’, infrared responses, and 
Raman spectra”’. Therefore, a domain wall between ABA- and ABC- 
stacked trilayers will separate two domains having different infrared 
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field. c, AFM topography map of a bilayer graphene sample on a SiO,/Si 
substrate, showing no surface wrinkles or defects. (The small triangular area at 
the bottom right corner corresponds to a graphene monolayer.) d, Near-field 
infrared nanoscopy image of the same bilayer graphene sample as in c. Bright 
line features across the bilayer graphene flakes are observed that are absent in 
the topography image. These bright lines arise from AB-BA domain walls in 
bilayer graphene because the domain walls have different local electronic 
structure and infrared responses. 


contrast in the near-field infrared image, as seen in domains | and II in 
Fig. 2c. In addition, we performed micro-Raman measurements on 
domains I and II, and show the corresponding spectra in Fig. 2d. It is 
apparent that the two-dimensional Raman features in domains I and II 
are different, and they have been shown to correspond to ABC- and 
ABA-stacked trilayer graphene”, respectively. It demonstrates unam- 
biguously that the bright line features in trilayers are layer stacking 
domain walls, and supports our assignment for bilayers. 

After identifying the layer stacking domain walls in bilayer graphene 
on a SiO,/Si substrate, we fabricated dual-gated bilayer field-effect tran- 
sistor devices*’ across the domain walls. As references, we also fabri- 
cated dual-gated devices without domain walls using the same device 
configuration on the same bilayer graphene flakes. Figure 3a displays the 
optical microscopy image of a representative device. The white dashed 
line outlines the bilayer graphene flake, and the green line denotes the 
position of the AB-BA domain wall in the bilayer graphene. Source and 
drain electrodes (electrodes 1-5) composed of 40 nm Au and 0.5 nm Cr 
layers were defined by electron-beam lithography. Electrodes 1 and 2 are 
connected by a layer stacking domain wall, as are electrodes 2 and 3. 
Electrodes 3 and 4 and electrodes 4 and 5, on the other hand, are within a 
single bilayer graphene domain and act as reference devices. On top of 
the bilayer graphene flake, a 35-nm-thick Al,O3; dielectric layer was 
grown using atomic layer deposition. Last, the top gate electrodes of 
width 1 jim (TG1 and TG2) were patterned using electron-beam litho- 
graphy and metal deposition. Figure 3b illustrates the side view of the 
device between electrodes 2 and 4, with a domain wall (green line) 
between electrodes 2 and 3 and single-domain bilayer graphene between 
electrodes 3 and 4. Here the doped silicon substrate acts as a bottom gate. 
By varying the top gate voltage Vi, and the bottom gate voltage Vig, we 
can independently control the electric field above and below the bilayer 
graphene to generate a tunable semiconductor bandgap”. 
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Figure 2 | Infrared nanoscopy images of domains and domain walls in 
exfoliated mono-, bi- and trilayer graphene. a, Infrared image of a graphene 
monolayer. The darker region corresponds to the SiO2/Si substrate. 
Monolayer graphene shows homogeneous contrast across the whole sample 
with no bright lines in the interior area. (The bright lines parallel to the edge 
arise from plasmon excitations.) The same behaviour is observed in all 
exfoliated monolayers. b, Infrared image of a graphene bilayer, which shows 
bright line features across the flake. Such bright lines arise from AB-BA 
domain walls in bilayer graphene and they are observed in ~30% of 
exfoliated bilayer graphene samples. The bulk AB and BA domains have the 


Figure 3c displays the electrical transport at 4 K ina reference bilayer 
graphene device without a domain wall. Each trace shows the gate- 
dependent resistance as a function of V;, at a fixed bottom gate voltage, 
Vbg: Vig Was varied in steps of 10 V to produce the different traces. The 
resistance peak in each trace corresponds to a charge neutral point 
(CNP), at which the Fermi level resides within the semiconductor 
bandgap. CNPs can be realized with different combinations of Vig 
and Vp,, but each with a different vertical electrical field and induced 
semiconductor bandgap in bilayer graphene. The zero-bandgap state is 
realized at Vog” =0V and Veg. = 1.8V, which has the lowest peak 
resistance. The average displacement field* across bilayer graphene is 
described by D = ¥2(D,+D,). Here Dy = ésio, | Vog — Vog”| /dsio,» and 
Dt =€ai,0, | Vig — Vig? | /dai,o,> Where ¢ and dare the dielectric constant 
and thickness of the respective oxide dielectric layers. The bandgap 
increases monotonically with the average vertical displacement field D, 
which results in a larger peak resistance at the CNP. The experimental 
values of the peak resistances are lower than that expected from the 
semiconductor bandgap in ideal bilayer graphene, presumably owing 
to impurities and defects in our devices. Nevertheless, the resistance 
can reach ~90kQ at 4.2 K, similar to that observed in previous dual- 
gated bilayer graphene devices on SiO,/Si substrates*. 

Electrical transport through bilayer graphene with an AB-BA 
domain wall exhibits a different behaviour, as shown in Fig. 3d. 
When the bilayer graphene bandgap is small (purple traces), the 


652 | NATURE | VOL 520 | 30 APRIL 2015 


) ! ! 


2,650 2,750 2,850 


a(cm") 


same electronic structure and infrared contrast. c, Infrared image of a 
graphene trilayer, which exhibits domains with different infrared contrast as 
well as bright lines between domains. Different stacking orders, such as ABA 
and ABC stacking, can lead to different electronic structures and infrared 
contrast in trilayers. d, Raman spectra taken at domains I and II in c; w is the 
Raman shift. The two-dimensional peak of domain I (black curve) and 
domain II (red curve) are characteristic of ABC and ABA stacking, 
respectively. It confirms that the bright and dark trilayer regions in 

c correspond to different layer stacking orders, and that the bright line 
features arise from layer stacking domain walls. 


gate-dependent resistance curves are similar to those in single-domain 
bilayer graphene devices because bulk current flowing through the 
domain dominates. With higher displacement fields and larger bilayer 
graphene bandgaps, the peak resistance in each curve first increases 
and then saturates at ~ 14 kQ (red traces in Fig. 3d), in striking contrast 
to the monotonically increasing resistance to over 80 kQ in the ref- 
erence device without a domain wall (Fig. 3c). Figure 4a plots the 
comparison of bilayer graphene peak resistances in the device with a 
domain wall (red circles) and without a domain wall (black squares) as 
a function of the vertical displacement field D, explicitly showing that 
saturation of the peak resistances occurs at a much lower value, owing 
to the presence of the domain wall. 

These distinct transport behaviours in bilayer graphene with and 
without AB-BA domain walls were observed in all of the bilayer gra- 
phene devices that we studied (see Methods for details.) It is apparent 
that the conductance is always much higher in bilayer graphene 
devices with AB-BA domain walls. In addition, we performed temper- 
ature dependence studies of CNP resistance in bilayer graphene with 
and without domain walls from 1.8 K to 50 K. We observed that the 
resistance of bulk bilayer graphene increases many times at lower 
temperature, characteristic of an insulating phase. The resistance 
across the domain wall, however, behaves like a metallic system with 
much weaker temperature dependence (see Methods for details). Our 
results demonstrate unambiguously that one-dimensional conducting 
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Figure 3 | Dual-gated field-effect transistor devices on bilayer graphene 
with AB-BA domain walls and their electrical transport properties. 

a, Optical micrograph of a dual-gated bilayer device. The white dotted line 
outlines the bilayer graphene flake, and the green line denotes the AB-BA 
domain wall. Source and drain electrodes are labelled from 1 to 5. Electrodes 1 
and 2 are connected by a layer stacking domain wall, as are electrodes 2 and 3. 
Electrodes 3 and 4 and electrodes 4 and 5, on the other hand, are within a single 
bilayer graphene domain and act as reference devices. TG1 and TG2 denote top 
gate electrodes. b, Side view of the dual-gated bilayer device between electrodes 
2 and 4 in a. c, Gate-dependent resistance for bilayer graphene without a 


channels are present at the AB-BA domain walls, and that they dom- 
inate the electrical transport when bulk bilayer graphene domains 
become insulating. 

To examine the conducting channels through the bilayer graphene 
domain walls, we systematically investigated bilayer graphene devices 
with varying channel length (defined by the top gate width) along the 
same bilayer graphene domain wall. Figure 4b displays the measured 
conductance of the two sets of bilayer graphene domain wall channels 
with lengths of 200 nm, 400 nm, 600 nm and 800 nm, respectively and 
another batch of 1,000-nm-long bilayer graphene domain wall channels. 
All the conductance values are of the order of the quantum conductance 
(2e7/h, where ¢ is the unit charge and h is Planck’s constant), and one of 
the shortest channels exhibits a conductance approaching 4e”/h. 

The observed one-dimensional conducting channels correspond to 
the chiral edge modes at AB-BA domain walls recently predicted by 
theory'’*. Figure 4c shows the electronic band structure around the 
K and K’ points for bilayer graphene with an AB-BA domain wall. In 
gapped bilayer graphene, the AB- and BA-stacked domains represent 
two different quantum valley Hall insulators. Consequently, a domain 
wall separating insulating AB and BA domains will confine a pair of one- 
dimensional conducting channels along the boundary for each valley. 
The conducting electrons in different valleys have opposite energy dis- 
persions, and therefore propagate with opposite velocities. Such counter- 
propagating, valley-projected chiral electrons along the AB-BA domain 
walls are illustrated in Fig. 4d in real space. These conducting chiral 
modes are topologically protected within a single valley. Inter-valley 
scatterings can potentially mix the forward- and backward-moving elec- 
trons and eliminate the chiral modes, analogous to magnetic impurities 


domain boundary (electrodes 3 and 4). The backgate voltage Vpg is varied from 
0 V to 140 V with a step of 10 V (colours here and in d indicate different curves), 
and the top gate voltage V;, is swept continuously. The resistance peak in each 
trace corresponds to a CNP. The resistance at the CNP increases continuously 
(dashed grey line) to over 80 kQ at high V,,, owing to the opening of a bandgap. 
d, Gate-dependent resistance across bilayer graphene with a domain boundary 
(electrodes 2 and 3). The resistance at the CNP first increases and then saturates 
at ~14kQ, although the bilayer graphene bandgap keeps increasing with 
increasing Vp,. This much lower resistance signifies the presence of a one- 
dimensional conducting channel at the AB-BA domain wall. 


that flip spins and eliminate chiral modes in quantum spin Hall insula- 
tors. However, at atomic-defect-free AB-BA domain walls where the 
crystal lattice is smoothly varying, such inter-valley scattering will be 
absent because it requires a large momentum change of 417/ (3/3) Aco 
where a... is the carbon-carbon bond length of 1.42 A 

We can quantitatively compare the observed one-dimensional con- 
ductance along the AB-BA domain wall with theoretical predictions. 
As illustrated in Fig. 4d, two conducting chiral modes are present at the 
domain wall for a given current flow direction. In addition, each chiral 
mode is doubly degenerate owing to electron spin. Therefore, four one- 
dimensional conducting channels exist at the domain wall, giving an 
ideal conductance of 4e’/h, that is, two conductance quanta. 
Experimentally, we observed conductance of the order of the conduc- 
tance quanta in all devices, and one 200-nm-long domain wall channel 
exhibits conductance close to the value of 4e”/h (Fig. 4b). We found 
that overall the one-dimensional channel conductance becomes lower 
with increasing channel length. The observed length-dependent con- 
ductance through bilayer graphene domain walls can be described by 
the Landauer-Bittiker formula” with a finite electron mean free path: 


-1 
G= (R +Ro (1 + E)) , where L is the channel length, Ro = h/4e?, 


the contact resistance R, = 0, and the mean free path Lp = 420nm 
(solid line in Fig. 4b). The electron mean free path of 420 nm can arise 
from a finite inter-valley scattering along the bilayer graphene domain 
walls, and previous transport studies have reported a comparable 
inter-valley scattering length of 300-800 nm in bilayer graphene on 
SiO,/Si substrates”. Nevertheless, this electron mean free path along 
the bilayer graphene domain wall is orders of magnitude longer than 
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Figure 4 | Conductivity of topologically protected chiral modes at AB-BA 
domain walls. a, CNP resistances as a function of displacement field D for 
bilayer graphene with no domain wall (NDW, black squares) and for bilayer 
graphene with a domain wall (DW, red circles) obtained from Fig. 3. The 
saturation of the resistance at a much lower value at high D originates from one- 
dimensional conducting channels at the domain wall. b, Length dependence of 
the bilayer graphene domain wall conductance. Green and blue symbols 
represent two sets of devices, each fabricated on one single domain wall in the 
same bilayer graphene flake. Red symbols represent another batch of 1,000-nm- 
long domain wall channels. All devices exhibit conductance of the order of the 
quantum conductance, with lower conductance in longer channels and a 
maximum conductance approaching 4e?/h. This length-dependent one- 


that in the bilayer graphene domain, which has a mobility of 
~2,000 cm? V's! anda mean free path of only ~8 nm in our sam- 
ples. In general, scattering and localization effects tend to be more 
important in one dimension. The much longer mean free path at the 
AB-BA domain walls therefore highlights the topological protection of 
chiral electron modes, where the electron velocity-valley locking for- 
bids intra-valley backscattering. Because inter-valley scattering along 
the smooth domain walls is negligible, the chiral valley channels at AB- 
BA domain walls are very robust even for bilayer graphene devices on 
SiO2/Si substrates. We envision that encapsulation in boron nitride 
layers” could potentially lead to chiral electrons at AB-BA domain 
walls with very long coherent lengths. 

Our observation of topologically protected one-dimensional con- 
ducting channels at bilayer graphene domain walls opens up exciting 
opportunities to explore tunable topological phases and valley physics 
in graphene. Owing to the velocity-valley locking, the one-dimensional 
chiral modes at the AB-BA domain walls lead not only to ballistic 
charge transport, but also to ballistic valley transport. Such chiral 
modes can enable novel ways to control the valley degree of freedom 
for valleytronic devices based on bilayer graphene. 
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Mechanism of the infrared contrast for domain walls. Our near-field infrared 
nanoscopy probes the local optical conductivity at infrared frequencies with ~40 nm 
spatial resolution. The infrared contrast of the domain wall arises from its different 
electronic structure and infrared conductivity from the AB-stacked bilayer graphene. 
An exact description of the local band structure of a domain wall is beyond the scope 
of this work. However, the domain wall region is characterized by shifted top and 
bottom layers, which can be approximated by two randomly stacked monolayers of 
graphene. Such randomly stacked graphene has electronic structure similar to that of 
two separated monolayers close to the Dirac point”, and its infrared conductivity can 
be approximated by 20 monolayer. The AB-stacked bilayer graphene, on the other hand, 
has an infrared conductivity of Opiayer- The optical conductivities of both monolayer 
and AB-stacked bilayer graphene have been well established in the literature**. At the 
infrared wavelength of 6.1 jum, the value of 2¢ monolayer is about 20% higher than that 
of Opitayer, Which can qualitatively explain the bright infrared contrast of the domain 
walls observed in our studies. 

Electrical transport data of four extra bilayer graphene devices. We investi- 
gated five bilayer graphene device pairs with a fixed channel length of 1 jim. In 
Extended Data Fig. 1, we show the complete set of data taken from the four pairs of 
bilayer graphene devices other than that included in Fig. 3c and d. Each pair of 
bilayer graphene devices includes one with AB-BA stacking domain walls and one 
reference device with no domain walls on the same graphene flake. Extended Data 
Fig. la-d displays resistances of corresponding bilayer graphene devices with 
domain walls, including three single-domain-wall devices (Extended Data Fig. 
la-c) and one double-domain-wall device (Extended Data Fig. 1d). All single- 
domain-wall devices show resistances saturating at 14kQ with increasing D, and 
the double-domain-wall device has a resistance saturating around 8 kQ. For com- 
parison, Extended Data Fig. le-h displays the gate-dependent resistance for four 
bilayer graphene devices without boundaries. These devices show continuously 
increasing CNP resistance to well above 14 kQ (8 kQ in Extended Data Fig. 1h), as 


indicated by the dashed line at high Vpg. All the data show that one-dimensional 
conducting channels are present at AB-BA domain walls, and that they dominate 
electrical transport in gapped bilayer graphene. 

Length dependence of electrical transport along AB-BA domain walls. We 
investigated two sets of dual-gated bilayer graphene domain wall devices of vary- 
ing channel lengths (defined by the top gate width). Each set of devices are 
fabricated on one single domain wall in the same bilayer graphene flake, and 
includes four channels with lengths of 200nm, 400nm, 600nm and 800nm, 
respectively. The complete transport data of these bilayer graphene domain wall 
devices are shown in Extended Data Fig. 2. 

Temperature dependence of electrical transport of AB-BA domain walls. We 
investigated the temperature dependence of electrical transport both through 
AB-stacked bilayer graphene and through AB-BA domain walls. Extended 
Data Fig. 3 displays representative data for CNP resistance at Vpg = — 140 V 
for a bilayer graphene device without a domain wall (black line) and a device 
with a domain wall (red line). For the bilayer graphene bulk device without a 
domain wall, the CNP resistance increases by over 300 kQ (2.5 times) as the 
temperature is varied from 50 K to 1.8K. This large increase of resistance at 
lower temperatures is characteristic of insulating behaviour. In comparison, 
the CNP resistance of device with a domain wall (red line) has much weaker 
temperature dependence: the resistance increase is only 5kQ (~50%), and 
part of the increase is due to the contribution from bulk bilayer graphene 
channel in parallel with the one-dimensional domain wall channel. The weak 
temperature dependence of conductance along the domain wall is character- 
istic of a metallic system. 
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Extended Data Figure 1 | Electrical transport of dual-gated bilayer 
graphene devices with (left) and without (right) AB-BA domain walls. 
a-d, Gate-dependent resistance R for three bilayer graphene devices with one 
domain wall (a—c) and one with two domain walls (d) at 4.2 K. The backgate 
voltage Vpg is varied from 0 V to 140 V with a step of 10 V, and the top gate 
voltage Vi, is swept continuously. The resistance peak in each trace corresponds 
to a CNP. The resistance at the CNP first increases and then saturates at 
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~14kQ in a-c (~8 kQ in d). The dashed lines indicate the saturated 
conductance values. e-h, Gate-dependent resistance of four reference bilayer 
graphene devices without domain walls. The resistance at the CNP increases 
continuously at high V,, owing to the opening of a bandgap, and reaches values 
much higher than that of corresponding devices with bilayer graphene domain 
walls (indicated by the dashed line). 
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Extended Data Figure 2 | Length dependence of electrical transport of AB- 
BA domain walls. Data in left panels are from one graphene flake, data in right 
panels are from another. a-d, Gate-dependent resistance for four bilayer 
graphene devices fabricated on a long domain wall with channel lengths of 
200 nm, 400 nm, 600 nm and 800 nm (defined by the top gate width), 
respectively. e-h, Gate-dependent resistance for four bilayer graphene devices 
fabricated on another long domain wall with channel lengths of 200 nm, 


SoS 


400 nm, 600 nm and 800 nm respectively. The backgate voltage Vp, is varied 
from 0 to — 140 V with a step of 10 V, and the top gate voltage V;, is swept 
continuously. The resistance peak in each trace corresponds to a CNP. The 
resistance at the CNP first increases as the bandgap is opened in the bulk and 
then saturates at high V,,, where the electrical transport is dominated by one- 
dimensional channels along the domain wall. 
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Extended Data Figure 3 | Temperature-dependent electrical transport (Vig = —140 V) of a bilayer graphene device with domain wall (red line, the 
through bulk bilayer graphene and through the AB-BA domain wall. The same device as in Extended Data Fig. 2c) increases by only 5 kQ (~50%) at the 
CNP resistance (Vig = — 140 V) of a bilayer graphene device without adomain same temperature. The much weaker temperature dependence of the domain 
wall (NDW, black line) increases by over 300 kQ (~2.5 times) as the wall resistance arises from a metallic one-dimensional conducting channel. 


temperature is decreased from 50 K to 1.8 K. In contrast, the CNP resistance 
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High-mobility three-atom-thick semiconducting 
films with wafer-scale homogeneity 
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The large-scale growth of semiconducting thin films forms the basis 
of modern electronics and optoelectronics. A decrease in film thick- 
ness to the ultimate limit of the atomic, sub-nanometre length scale, 
a difficult limit for traditional semiconductors (such as Si and 
GaAs), would bring wide benefits for applications in ultrathin and 
flexible electronics, photovoltaics and display technology’*. For 
this, transition-metal dichalcogenides (TMDs), which can form 
stable three-atom-thick monolayers’, provide ideal semiconducting 
materials with high electrical carrier mobility*"°, and their large- 
scale growth on insulating substrates would enable the batch 
fabrication of atomically thin high-performance transistors and 
photodetectors on a technologically relevant scale without film 
transfer. In addition, their unique electronic band structures provide 
novel ways of enhancing the functionalities of such devices, includ- 
ing the large excitonic effect", bandgap modulation”, indirect-to- 
direct bandgap transition’, piezoelectricity'* and valleytronics”. 
However, the large-scale growth of monolayer TMD films with 
spatial homogeneity and high electrical performance remains an 
unsolved challenge. Here we report the preparation of high-mobility 
4-inch wafer-scale films of monolayer molybdenum disulphide 
(MoS) and tungsten disulphide, grown directly on insulating SiO, 
substrates, with excellent spatial homogeneity over the entire films. 
They are grown with a newly developed, metal-organic chemical 
vapour deposition technique, and show high electrical performance, 
including an electron mobility of 30cm? V~‘s_' at room temper- 
ature and 114cm”V~'s~' at 90K for MoS,, with little dependence 
on position or channel length. With the use of these films we 
successfully demonstrate the wafer-scale batch fabrication of high- 
performance monolayer MoS, field-effect transistors with a 99% 
device yield and the multi-level fabrication of vertically stacked tran- 
sistor devices for three-dimensional circuitry. Our work is a step 
towards the realization of atomically thin integrated circuitry. 
Existing growth methods for large-scale monolayer TMDs have so far 
produced materials with limited spatial uniformity and electrical per- 
formance. For instance, the sulphurization of metal or metal compounds 
only provides control over the average layer number, producing spatially 
inhomogeneous mixtures of monolayer, multi-layer and no-growth 
regions'®'’, Although chemical vapour deposition (CVD) based on 
solid-phase precursors (such as MoO3, MoCl; or WO3)'*”* has shown 
better thickness control on a large scale, the electrical performance of the 
resulting material, which is often reported from a small number of devices 
in selected areas, fails to show spatially uniform high carrier mobility. 
Here we report the growth of semiconducting monolayer films of 
MoS, and tungsten disulphide (WS,) on silicon oxide on a 4-inch 
wafer scale, with both excellent electrical performance and structural 
continuity, maintained uniformly over the entire films. Figure 1 pre- 
sents our continuous TMD monolayer films and shows their wafer- 
scale homogeneity and intrinsic optical properties. The colour photos 
of MoS, (Fig. la; greenish yellow) and WS, (Fig. 1b; yellow) films 


grown on a transparent 4-inch fused silica wafer show that the TMD 
grown region (right half) is uniform over the whole substrate and 
clearly distinguishable from the bare silica substrate (left half). The 
optical absorption, photoluminescence and Raman spectra measured 
from our films show characteristics unique to monolayer MoS, and 
WS,, respectively (Fig. 1d-f). All of these measured spectra have the 
same peak positions as in exfoliated monolayer samples (denoted by 
diamonds)’***°, regardless of the location of the measurements 
within our films (Supplementary Fig. 1). The X-ray photoelectron 
spectra taken from our monolayer MoS, film show almost identical 
features to those of bulk single crystal with a low level of defects, further 
confirming the precise chemical composition and the high quality of 
our MoS, film (Supplementary Fig. 2). 

Figure 1c shows a photo of a MoS, film grown on a 4-inch SiO,/Si 
wafer. The monolayer film was patterned using standard photolitho- 
graphy and oxygen plasma etching to form MoS,-covered squares 
(dark, 6 mm wide) with an array of 3 um holes. An enlarged, normal- 
ized optical reflection image (Fig. 1h) displays a homogeneous 
reflection contrast for the entire MoS -covered region, confirming uni- 
form monolayer growth everywhere with no gaps. In addition, Fig. 1g 
shows a scanning electron microscope (SEM) image of an array of fully 
suspended monolayer MoS, membranes (2 |1m in diameter) fabricated 
by transferring our metal-organic chemical vapour deposition 
(MOCVD)-grown film onto a SiN grid with holes. Its high fabrication 
yield (>99.5%) suggests mechanical strength and continuity of the film. 
The widefield photoluminescence images of these films (insets to 
Fig. 1g, h) show strong, spatially uniform photoluminescence signals, 
further confirming that they are continuous monolayer MoS,, with its 
high quality maintained even after patterning or transfer. The same 
spatial uniformity was seen in the optical reflection and photolumines- 
cence images of a monolayer WS, film that was similarly grown and 
patterned (Fig. li). Together, the data in Fig. 1 confirm that our MoS, 
and WS, films are continuous monolayers, spatially uniform over the 
entire 4-inch growth substrates with intrinsic optical properties. Below, 
using MoS, as the main example, we discuss the growth (Fig. 2) and the 
excellent electrical properties (Fig. 3) of these MOCVD-grown films. 

Figure 2a schematically explains our MOCVD growth, where we 
only use gas-phase precursors of Mo(CO).¢, W(CO)¢, (C2Hs)2S and 
Hz, all diluted in argon as a carrier gas (see Supplementary Methods). 
The concentration of each reactant can be precisely controlled during 
the entire growth time by regulating the partial pressure (Px) of each 
reactant X. Thus our setup offers an ideal environment for maximizing 
the areal coverage of the monolayer and for engineering the film 
structure by controlling the nucleation density and intergrain stitch- 
ing. Figure 2 summarizes our key findings. 

First, our MoS, film is grown in the layer-by-layer growth mode, 
which is ideal for uniform layer control over the large scale. Figure 2c 
plots the areal coverage of monolayer (6,;) and multilayer (0=21; 
mostly bilayer) regions measured from our MoS, grown on SiO,/Si; 
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Figure 1 | Wafer-scale monolayer TMD films. a, b, Photographs of 
monolayer MoS, (a) and WS, (b) films grown on 4-inch fused silica substrates, 
with diagrams of their respective atomic structures. The left halves show 

the bare fused silica substrate for comparison. c, Photograph of patterned 
monolayer MoS, film on a 4-inch SiO2/Si wafer (the darker areas are covered 
by MoS,). d, Optical absorption spectra of MOCVD-grown monolayer MoS 
(red line) and WS, (orange line) films in the photon energy range from 1.6 to 
2.7 eV. e, Raman spectra of as-grown monolayer MoS, and WS,, normalized 
to the silicon peak intensity. f, Normalized photoluminescence spectra of 
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Fig. 2b shows optical images at different growth times, revealing the 
initial nucleation on the SiO, surface (t = 0.5t9), subsequent mono- 
layer growth near (0.8¢)) and at the maximum monolayer coverage 
(to), followed by nucleation mainly at grain boundaries (1.2t9) and 
bilayer growth (2t9). We observed no nucleation of a second layer 
while the first layer was forming (0=2, ~ 0 when t< to), producing 
an optimal growth time fo near full monolayer coverage (0), ~ 1). 
Additional photoluminescence and electron microscope images taken 
after different growth times further suggested that the edge attachment 
was the main mechanism for the monolayer growth after nucleation 
and that the neighbouring monolayer grains were uniformly con- 
nected by tilt grain boundaries with enhanced photoluminescence’”® 
at t = ty (see Supplementary Notes and Supplementary Figs 3 and 4). 
The standard thin-film growth model’ suggests that this growth mode 
is effective below a certain deposition rate of the growth species, above 
which it suggests a different mode that forms thicker islands. Indeed, 
the layer-by-layer growth of MoS, film was observed only when we 
applied a low partial pressure (Py. ~ 10 * Torr in Fig. 2b, c) of Mo 
vapour (produced by the thermal decomposition of Mo(CO)<,; see 
Supplementary Notes and Supplementary Fig. 5) in the presence of 
excess (C,H;)2S. In contrast, the growth at a higher P\y, was no longer 
in the layer-by-layer growth mode, instead simultaneously producing 
a mixture of monolayer, multilayer and no-growth regions 
(Supplementary Fig. 6). For the uniform monolayer growth over a 
large substrate, it is thus important to maintain a low P,,, constantly 
over the entire growth region and over time, the key technical capabil- 
ity provided by our MOCVD setup (see Supplementary Fig. 7 for the 
spatially homogeneous monolayer nucleation on a multi-inch scale). 

Second, the grain structure of our MoS, film, including the average 
grain size and the intergrain connection, depends sensitively on the 
concentrations of H>, (C,H;)2S as well as residual water. As a repres- 
entative example, Fig. 2d shows the two main effects of H2, whose 
presence is necessary for removing carbonaceous species generated 
during the MOCVD growth: the average grain size increases from 
hundreds of nanometres to more than 10 jum with decreasing H, flow, 
and the MoS, grains grown under higher H, flow (Fig. 2d, right image) 


2 Fused silica 
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PLat 1.9 eV PL at 2.0 eV 
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as-grown monolayer MoS, and WS). The peak positions in d-f are consistent 
with those seen from exfoliated samples (diamonds). g, SEM image and 
photoluminescence (PL) image (bottom inset, at 1.9 eV) of monolayer (ML) 
MoS, membranes suspended over a SiN TEM grid with 2 um holes (a 
diagram of the suspended film is shown in the top inset). Scale bar, 10 um. 
h, i, Optical images (normalized to the bare substrate region) of patterned 
monolayer MoS, (h) and WS; (i) on SiO,, taken from the wafer-scale 
patterned films. The insets show photoluminescence images at 1.9 eV (h) and 
2.0 eV (i). Scale bars, 10 um. 


have mostly perfect triangular shapes without merging with neighbour- 
ing grains, a trend that disappears with lower H) flow (left and middle 
images). These observations are consistent with the Hj-induced decom- 
position of (C,Hs)2S (increasing nucleation due to hydrogenolysis)”* 
and the etching of the MoS, (preventing intergrain connection)” as 
reported previously. (For further discussion on the effects of (C2Hs)2S 
and water, see Supplementary Notes and Supplementary Fig. 6.) To 
grow continuous monolayer MoS, with a large grain size and high- 
quality intergrain stitching, we thus flowed optimal amounts of H, 
and (C,Hs)2S and dehydrated the growth environment. 

The darkfield transmission electron microscope (DF-TEM) and 
annular darkfield scanning TEM (ADF-STEM) images shown in 
Fig. 2e, f confirm the structural continuity of our MoS, film grown 
under those conditions on the nanometre and atomic length scales. 
The DF-TEM image shows a continuous polycrystalline monolayer 
film with no visible gaps and a bilayer area of less than 0.5%. Further 
analysis of the DF-TEM and electron diffraction data (see 
Supplementary Fig. 8) confirms a uniform angular distribution of 
crystal orientations with no preferred intergrain tilt angle for grain 
boundaries. The ADF-STEM data (Fig. 2f more images are shown 
in Supplementary Fig. 9) further confirm that adjacent grains are likely 
to be connected by a high-quality lateral connection with structures 
similar to those seen in previous reports'*”’. The MoS, films shown in 
Fig. 1 as well as those whose electrical properties we show in Figs 3 
and 4 were grown under the conditions described in Supplementary 
Methods, producing an average grain size of ~1 um (see Fig. 2b, e). 
Almost identical growth parameters with Pw ~ 10-* Torr produced 
monolayer WS, films as shown in Fig. 1b, i, indicating the same layer- 
by-layer growth for WS, with a similar fo. 

The electrical properties of our monolayer MoS, films have two 
important characteristics: the spatial uniformity over a large scale 
and excellent transport properties similar to those seen in exfoliated 
samples. All our electrical measurements in Figs 3 and 4 (except those 
in Fig. 3c) were performed at room temperature. Figure 3a shows a plot 
of sheet conductance (a) against backgate voltage (Vgc) measured 
from a monolayer MoS, field-effect transistor (FET; optical image 
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Figure 2 | MOCVD growth of continuous monolayer MoS, film. a, Diagram 
of our MOCVD growth setup. Precursors were introduced to the growth setup 
with individual mass flow controllers (MFCs). Red, Mo or W atom; yellow, 

S atom; white, carbonyl or ethyl ligand. b, Optical images of MOCVD-grown 
MoS, at the indicated growth times, where fy was the optimal growth time 
for full monolayer coverage. Scale bar, 10 um. c, Coverage ratio for monolayer 
(9,1, green) and multi-layer (0.1, purple) regions as a function of growth time. 
d, Grain size variation of monolayer MoS, depending on the hydrogen flow 
rate; from left to right, 5 standard cm? min”! (sccm) (SEM image shown), 

20 sccm (SEM) and 200 sccm (TEM). e, False-colour DF-TEM image showing a 
continuous monolayer MoS, film. Scale bar, 1 jm. f, ADF-STEM image of a 
laterally stitched grain boundary in a monolayer MoS, film, with red and yellow 
dots representing the Mo and S atoms, respectively. Scale bar, 1 nm. 


shown in the inset) with multiple electrodes for the four-probe mea- 
surements (except for a channel length L of 34 |m). It includes several 
curves for different L ranging between 1.6 and 34 jm (shifted from the 
bottom curve for clarity), all of which show nearly identical beha- 
viours, including the n-type conductance, carrier concentration 
(~4X 10cm ? at Vag = 0V) and high field-effect mobility (pp). 
Figure 3b further plots j¢pp measured from five such devices, fabricated 
at random locations and separated by up to 3.3 mm on a single chip. 
All the devices show similar upp near 30cm” V~'s~", independent of L 
and device location, with similarly uniform o 4-Vgg curves (shown in 
Supplementary Fig. 10), suggesting the spatial homogeneity of the 
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electrical properties of the MoS, film at length scales ranging from 
micrometres to millimetres. 

The distribution of ju; of our devices is compared with the results of 
multiple devices from two previous reports, each measured from indi- 
vidual grains of exfoliated’ or CVD-grown*’ MoS, samples. We find 
that jigz measured from our MOCVD film is similar to the median jpg 
(denoted by a star) of exfoliated samples (and several times higher than 
the CVD results) and has a much narrower distribution. In addition, the 
temperature dependence of /tpp (Fig. 3c) measured from the same device 
in Fig. 3a shows higher upp at lower temperatures (92 cm?V's? at 
100 K) and intrinsic, phonon-limited electron transport similar to the 
behaviours previously observed in exfoliated samples (data from ref. 6 
shown in Fig. 3c) but different from those observed from a CVD sample 
with stronger effects from defects”. Specifically, our data show that the 
temperature dependence of mobility follows a power law of jig: ~ T ” 
with exponent y = 1.6 for temperatures between 150 and 300 K, close to 
the value (1.69) predicted by theory” and consistent with results from 
previous experiments (average value ranging between 0.6 and 1.7)°° for 
a similar temperature range. Finally, Fig. 3d shows a high-performance 
MoS; FET fabricated with an individual top-gate electrode (Va). It has 
a high on/off conductance ratio (~ 10°), current saturation at relatively 
low bias Vsp (lower inset to Fig. 3d), high field-effect mobility 
(~29cm?V~'s~') and large transconductance (~2 uS pm™'), all of 
which are comparable to the best reported results**. We note that our 
devices studied in Fig. 3a—d were fabricated at random locations using a 
polycrystalline monolayer MoS, film, unlike the devices with single- 
grain samples used for comparison. In addition, the electrical properties 
measured from a separate monolayer MoS, film with a larger average 
grain size of 3 um (instead of 1 1m in Fig. 3) have almost identical 
characteristics, including the channel-length independence of jigg and 
the phonon-limited transport at T > 150 K (see Supplementary Fig. 11; 
with the low-temperature mobility as high as 114cm? V_'s~' at 90K). 
Taken together, our data confirm the spatial uniformity and high elec- 
trical performance of our MoS, FETs independent of the average grain 
size, which suggests that the intergrain boundaries in our film do not 
significantly degrade their electrical transport properties. This is prob- 
ably due to the formation of well-stitched intergrain boundaries with a 
low level of defects, an explanation also supported by the ADF-STEM 
(Fig. 2f) and X-ray photoelectron spectrum data (Supplementary Fig. 2) 
discussed above. Our data therefore lead us to conclude that our opti- 
mized MOCVD growth provides an electrically homogeneous mono- 
layer MoS, film. Moreover, we successfully fabricated and measured 60 
FETs by using a monolayer WS, film. Even though the growth of 
monolayer WS, was not carefully optimized, these devices showed 
excellent electrical properties, with their ppp as high as 18 cm ae is 
at room temperature (Fig. 3e) and a median jipz close to 5 cm avis! 
In addition, the WS, device in Fig. 3e showed a high on/off ratio of “108 
and the current saturation behaviour (inset to Fig. 3e) as in our MoS, 
devices. (See Supplementary Fig. 12 for data from additional monolayer 
WS, FET devices). 

The structural and electrical uniformity of our MoS, film enables 
the wafer-scale batch fabrication of high performance FETs as demon- 
strated in Fig. 3f, g. Figure 3f shows a photo of 8,100 MoS, FETs witha 
global back gate, which were fabricated on a 4-inch SiO,/Si wafer with 
a standard photolithography process. The middle and bottom insets 
to Figure 3f show colour-scale maps of o> measured from 100 MoS, 
FETs in one square region at Vgg = +50 V and —5OV, respectively; 
the top inset to Fig. 3f shows an enlarged optical image of the devices. 
We observed an almost perfect device yield of 99%; only two out of 200 
FETs that we characterized (including data from an adjacent region) 
did not conduct. Our data also confirm the spatially uniform n-type 
transistor operation (larger 0, for positive Vgc) with similar Vgg 
dependence for all our devices and high on-state device conductance. 
We further observed similarly uniform Vgq dependence from FET 
devices fabricated using monolayer MoS, films with different average 
grain sizes, as characterized by the histograms of the threshold voltages 
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Figure 3 | Electrical characterization and batch fabrication of monolayer 
TMD FETs. a, Gate-dependent sheet conductance (a4) of monolayer MoS, 
FETs measured at different channel lengths L (curves displaced from the 
bottom for clarity). Inset: optical image of the device; scale bar, 10 |1m. b, Field- 
effect mobility (¢::) measured from five MoS, FETs (ellipses) fabricated at 
random locations with different L. Data from previous results for CVD-grown 
samples” (squares) and exfoliated samples’ (triangles) are shown for 
comparison (purple stars indicate their medians). c, Temperature dependence 
of {tg measured from the device in Fig. 3a at L = 1.6 um (orange) and 8.6 1m 
(red), and from a previous report on exfoliated samples® (grey), both showing 
phonon-limited intrinsic transport. d, Top gate (V;q)-dependent o, for 
dual-gate monolayer MoS, FET (device shown in the upper inset). Lower inset: 


(Supplementary Fig. 13). Similarly, we fabricated 100 individually 
addressable dual-gate MoS, FETs (similar to the device in Fig. 3d) on 
another wafer piece. The histogram of the on-state 0g (Vrg =5V; 
median carrier concentration ~7 X 10'* cm~’) and off-state 4 
(Vig = -5 V) collected from all such FETs (Fig. 3g) shows strong peaks 
above 10 °S and near 10 ''S, respectively, confirming a uniform 
conductance switching behaviour with high on-state ¢4 (>10uS) 
and on-off ratio (~10°). In addition, most of these batch-fabricated 
FETs had a high ppg (>10cm? V_'s~'; see Supplementary Fig. 14). 

The data presented in Figs 1-3 confirm the structural and electrical 
uniformity of the wafer-scale monolayer MoS, film grown by our 
MOCVD method. This makes our film compatible with batch device 
fabrication processes on a technologically relevant scale. Moreover, 
because SiO, provides a substrate for its growth, one can produce 
high-quality monolayer films on a variety of substrates by depositing 
SiO, before the growth. This versatility would permit the fabrication of 
high-performance FETs directly on non-conventional substrates, such 
as metal and thermally stable plastic. In addition, one can integrate 
multiple layers of MoS, devices by repeating the TMD film growth, 
device fabrication and SiO, deposition, which could enable novel 
three-dimensional circuitry. 

In Fig. 4 we demonstrate this unique potential by producing 
multi-stacked monolayer MoS, films as well as electronic devices 
fabricated at different vertical levels. Figure 4a shows diagrams and 
photographs of three substrates each with single, double or triple 
monolayer MoS, films grown at different levels. The first (bottom) 
monolayer film was grown on a fused silica substrate; the additional 
layers were grown on SiO, (100 nm thick) deposited on the prev- 
iously grown MoS, monolayer by using plasma-enhanced CVD. The 
colour of the substrate, which remains uniform for each substrate, 
becomes darker as the number of layers increases. Their absorption 
spectra, shown in Fig. 4b, present almost identical absorption at all 
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Vrg-dependent Ispn—Vsp curves showing current saturation and ohmic 
electrode contact. Scale bar, 10 ttm. e, Gate-dependent o, of a monolayer WS, 
FET showing Lire = 18 cm? V_!s_ 1. Inset: Vrg-dependent Isp—Vgp curves 
showing current saturation and ohmic electrode contact. f, Batch-fabricated 
8,100 MoS, FET devices on a 4-inch SiO,/Si wafer. Top inset: enlarged image of 
one square containing 100 devices. Middle and bottom insets: corresponding 
colour maps of o 5 at gate bias Vag = 50 V and —50V, respectively, with the 
black block in the middle inset representing the only non-conducting device. 
g, Histogram of on-state and off-state a, of 100 dual-gate FETs showing a 
median on-off ratio of 10° and a high on-state conductivity. Dark blue, 

Vic = —5 V; pale blue, V>g = +5 V. All measurements were performed at 
room temperatures except those in c. 


measured wavelengths, once normalized by the number of stacks 
grown (see the inset), suggesting little degradation of the optical 
properties of the monolayer MoS, films after subsequent oxide 
deposition and MoS, growth. 

Figure 4c shows successive diagrams of our multi-stacked device 
fabrication process: growth of the first MoS, monolayer on a SiO2/ 
Si wafer, FET fabrication, deposition of SiO, (thickness 500 nm; see 
Supplementary Methods for details), and growth of the second 
MoS, monolayer and FET fabrication. A false-colour SEM image 
in Fig. 4d shows an array of MoS, FETs successfully fabricated with 
this process. It includes functioning MoS, FETs located at two 
different vertical levels whose conductance can be simultaneously 
modulated with a global back gate. The Isp-Vsp curves measured 
from two FETs, adjacent both laterally and vertically (see the inset 
to Fig. 4d), are shown in Fig. 4e. Both devices show a Vgg-depend- 
ent conductance change (notice the smaller change for the second 
layer) with an on-state 05 of 2.5 pS (first layer) and 1.5 uS (second 
layer), respectively. Furthermore, we measured similar jpg values 
(11.5 and 8.8cm?V~‘s~*) from the two devices (Supplementary 
Fig. 15). The two monolayer MoS, films were grown on SiO; 
substrates prepared differently, and the first-layer device had gone 
through additional steps, including the second MoS, growth. Our 
data in Fig. 4 thus confirm the compatibility of our MOCVD-grown 
MoS, films with conventional thin-film deposition and multi- 
stacking, which could be used to develop a three-dimensional 
device architecture based on TMD. 

Our high-mobility monolayer TMD films can be used immedi- 
ately for the batch fabrication of TMD-based integrated circuitry 
consisting of FETs, photodetectors and light emitting diodes, on 
a technologically relevant multi-inch wafer scale. In addition, 
because our MOCVD growth is controlled by the kinetics of 
precursor supply rather than specific precursor—substrate chemistry 
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Figure 4 | Multi-stacking of MoS,/SiO, structure. a, Diagrams (left) and 
optical images (right) of single, double and triple stacking of monolayer 
MoS,/SiO>. b, Optical absorption spectra for single, double and triple stacks, 
respectively (normalized spectra shown in the inset). c, Diagram for fabrication 
of MoS, device/SiO2 stacking by alternating MOCVD growth, device 
fabrication with photolithography, and SiO, deposition. See the text for details. 
d, False-colour SEM image of MoS, FET arrays on first (bottom) and second 
(top) layers (the inset shows an enlarged image of a pair of devices in the same 
relative positions as in the main panel; scale bar, 50 jim). e, Isp—Vsp curves 
measured from two neighbouring devices on the first (left) and second (right) 
layers, both showing n-type conductance switching. For the first layer, 

Vac = 50 V (green) and —50 V (grey); for the second layer, Vgg = 100 V 
(yellow) and —100 V (grey). 


(an example of the latter would be the different graphene growth 
modes on copper and nickel), its use is not limited to the TMD- 
substrate combinations reported here. Instead, it could be general- 
ized for producing various TMD materials, both semiconductor (for 
example MoSe, or WTe2) and metal (for example NbSe, or TaS2), 
with precise layer control over a large scale. Indeed, our data show 
that, as an initial demonstration, monolayer TMD growth is possible 
on a variety of other technologically important substrates (such as 
Al,O3, SiN and HfO,) with the same growth conditions as those 
developed for SiO, (see Supplementary Fig. 16 for MoS, growth 
and device fabrication on these substrates, using these non- 
optimized conditions). Our versatile MOCVD growth therefore 
provides a new avenue for the growth, patterning and integration 
of multiple high-quality monolayer TMD films with different com- 
positions and electrical properties on a single substrate, enabling the 
future development of atomically thin integrated circuitry. 
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Precise interpolar phasing of abrupt climate change 


during the last ice age 


WAIS Divide Project Members* 


The last glacial period exhibited abrupt Dansgaard-Oeschger cli- 
matic oscillations, evidence of which is preserved in a variety of 
Northern Hemisphere palaeoclimate archives’. Ice cores show that 
Antarctica cooled during the warm phases of the Greenland 
Dansgaard-Oeschger cycle and vice versa”’, suggesting an inter- 
hemispheric redistribution of heat through a mechanism called the 
bipolar seesaw**. Variations in the Atlantic meridional overturn- 
ing circulation (AMOC) strength are thought to have been import- 
ant, but much uncertainty remains regarding the dynamics and 
trigger of these abrupt events’ °. Key information is contained in 
the relative phasing of hemispheric climate variations, yet the 
large, poorly constrained difference between gas age and ice age 
and the relatively low resolution of methane records from 
Antarctic ice cores have so far precluded methane-based synchron- 
ization at the required sub-centennial precision”*”°. Here we use a 
recently drilled high-accumulation Antarctic ice core to show that, 
on average, abrupt Greenland warming leads the corresponding 
Antarctic cooling onset by 218 + 92 years (2c) for Dansgaard- 
Oeschger events, including the Bolling event; Greenland cooling 
leads the corresponding onset of Antarctic warming by 
208 + 96 years. Our results demonstrate a north-to-south direc- 
tionality of the abrupt climatic signal, which is propagated to the 
Southern Hemisphere high latitudes by oceanic rather than atmo- 
spheric processes. The similar interpolar phasing of warming and 
cooling transitions suggests that the transfer time of the climatic 
signal is independent of the AMOC background state. Our find- 
ings confirm a central role for ocean circulation in the bipolar 
seesaw and provide clear criteria for assessing hypotheses and 
model simulations of Dansgaard—Oeschger dynamics. 

Net heat transport by the Atlantic branch of the global ocean over- 
turning circulation is northwards at all latitudes, resulting in a heat flux 
from the Southern Hemisphere (SH) to the Northern Hemisphere 
(NH)*. Variations in this flux act to redistribute heat between the 
hemispheres, a mechanism commonly invoked to explain abrupt 
sub-orbital climatic variability and the asynchronous coupling of 
Greenland and Antarctic temperature variations on these timescales”®. 
Millennial-scale AMOC variability is corroborated by North Atlantic 
proxies for deep-water ventilation and provenance that suggest 
decreased North Atlantic deep water production and the intrusion 
of southern-sourced water masses during stadial (that is, North 
Atlantic cold) periods’’”*. The oceanic instabilities are accompanied 
by shifting atmospheric transport patterns. Proxy data and climate 
models consistently indicate northward and southward migrations 
of the intertropical convergence zone in response to abrupt NH warm- 
ing and cooling, respectively'*"*. Abrupt NH events may also induce 
changes in the strength and position of the SH mid-latitude westerlies. 
Strengthened and/or southward-shifted westerlies during NH stadials 
have the potential to warm the Southern Ocean and Antarctica by 
enhancing the wind-driven upwelling of relatively warm circumpolar 
deep waters, providing a direct atmospheric pathway for the bipolar 
seesaw to operate’’. Climate model simulations further suggest that 
atmospheric readjustment in response to decreased North Atlantic 


deep water formation can induce inhomogeneous temperature 
changes over the Antarctic continent’®’, possibly by means of 
wind-driven changes in sea-ice distribution’. 

Atmospheric teleconnections operate on seasonal to decadal time- 
scales because of the fast response time of the atmosphere*’*”. By 
contrast, oceanic teleconnections can operate on a wide range of time- 
scales from decadal to multi-millennial, depending on the processes 
and ocean basins involved'?*". Climatic signals can be rapidly propa- 
gated from the North Atlantic to the South Atlantic via Kelvin waves’, 
but models suggest a more gradual (centennial timescale) propagation 
from the South Atlantic to the SH high latitudes as a result of the 
absence of a zonal topographic boundary at the latitudes of the 
Antarctic circumpolar current (ACC)'°”’. The timescale on which 
newly formed North Atlantic deep water is exported to the Southern 
Ocean is similarly around several centuries”. The bipolar seesaw rela- 
tionship observed between ice core records potentially bears the 
imprint of both atmospheric and oceanic teleconnections; precise con- 
straints on the interhemispheric phasing can help distinguish which 
mode dominates, and can also identify leads and lags’. 

Completed in December 2011, the West Antarctic Ice Sheet (WAIS) 
Divide ice core (WDC)”’ was drilled and recovered to a depth of 
3,405 m. Here we present results from the deep part of the core 
(Fig. 1), extending back to 68kyr before 1950 (before present; BP). 
The WDC climatic records show no stratigraphic disturbances (such 
as large-scale folds) that are commonly encountered near the base of 
ice cores, a fact we attribute to basal melting at the site that removes old 
ice before such disturbances can develop and that decreases shear 
stress by lubricating the bed. The Greenland NGRIP core similarly 
has strong basal melting and an undisturbed stratigraphy’. WDC 
5180 of ice, a proxy for local condensation temperature, exhibits clear 
millennial-scale variability as observed across the Antarctic continent 
(Fig. 1c, d). We evaluate the phasing of WDC millennial variability 
relative to 5'%O of the Greenland NGRIP core (Fig. 1a). For each 
Greenland Dansgaard—Oeschger (DO) event we can identify a corres- 
ponding Antarctic Isotopic Maximum (AIM) event’, although AIM 9 
is only very weakly expressed in WDC (we adopt the naming conven- 
tion whereby AIM x is concomitant with DO x). 

To investigate the phasing of the bipolar seesaw, we have synchro- 
nized WDC to the Greenland NGRIP core by means of globally well- 
mixed atmospheric methane (CH,; Fig. 1b)”*?. WDC allows CH, 
synchronization at unprecedented, sub-centennial precision as a 
result of the small difference between gas age and ice age (Aage) 
and continuous, centimetre-scale resolution CH, record (Extended 
Data Figs 1 and 2). Because uncertainties in the relative phasing of 
CH, and Greenland climate** are smaller than uncertainties in 
Greenland Aage, we synchronize WDC CH, directly to NGRIP 
5/80 (rather than to NGRIP CH,)”. 

As pointed out by several authors®”, a strong anti-correlation exists 
between Greenland 5!°O and the rate of change (that is, the first time 
derivative) of Antarctic 8'8O. This relationship is consistent with a 
simple thermodynamic model of the bipolar seesaw in which Antarctic 
temperature variations are moderated by the large thermal mass of the 
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Figure 1 | Records of glacial abrupt millennial-scale climatic variability. 

a, Greenland NGRIP 880 record! on GICC05 X 1.0063 chronology 
(Methods). b, WDC discrete CH, record on the WD2014 chronology, which is 
based on layer counting (0-31.2 kyr) and CH, synchronization to NGRIP 


Southern Ocean®. In evaluating this relationship at WAIS Divide 
(Fig. 2a, blue curve), we find the strongest anti-correlation at a NH 
lead of about 170 years (blue dot in Fig. 2a). This timing is distinct from 
the zero-year lead commonly assumed in mathematical descriptions of 
the bipolar seesaw. The analysis is repeated using an ensemble of 
4 X 10° alternative WDC chronologies (Methods), showing that the 
centennial-scale timing is a robust result (Fig. 2d) with an estimated 
uncertainty of 69 years (20 uncertainty bounds are used throughout 
this work). Evaluation of the cross-correlations using WDC CH, 
instead of NGRIP 5'%O gives almost identical results (Fig. 2a, green 
curve), in line with the notion that CH, is a good proxy for Greenland 
climate’. 

Next we analyse the DO-AIM coupling in more detail. The bipolar 
seesaw theory suggests that during Greenland stadial periods heat 
accumulates in the SH, causing gradual warming of Antarctica; during 
interstadial periods Antarctica is cooling. The abrupt stadial-intersta- 
dial transitions in Greenland are accompanied by a breakpoint in the 
WDC 8'%0 record, where the warming trend changes to a cooling 
trend. Although the timing of abrupt transitions can be established 
unambiguously in both CH, and Greenland 5'°O records, the smaller 
signal-to-noise ratio of Antarctic 5'°O time series complicates break- 
point identification for individual AIM events. To overcome this lim- 
itation, we stack and average the AIM events to detect their shared 
climatic signal (Methods and Extended Data Figs 3 and 4). We align 
WDC records at the midpoint of the DO CH, transitions, which is set 
to lag the midpoint in the NGRIP 5'°O transition by 56 + 38 years”. 
The relative timing of the WDC 8'*O and CH, curves is determined by 
Aage, which is the largest source of uncertainty (Methods). 

We find that Antarctic cooling is delayed relative to abrupt NH 
warming by 218+92years on average (Fig. 2b, e); similarly, 
Antarctic warming lags NH cooling by 208 + 96 years (Fig. 2c, f). 
The robustness of this result is demonstrated with a Monte Carlo 
analysis in which we randomly perturb the relative alignment of the 
individual AIM events, in combination with an ensemble of 4 X 10° 
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(31.2-68 kyr)’. c, WDC 8180 record. d, Antarctic temperature stack (ATS)*° in 
degrees Celsius relative to the present day on AICC12 X 1.0063 chronology. 
DO/AIM events are indicated with orange vertical bars, numbered at the 
bottom of the figure. 


alternative WDC chronologies. The timing of the WDC 8'%O break- 
point (orange dots in Fig. 2b, c) is determined by using an automated 
fitting algorithm (Methods and Extended Data Fig. 5). Performing the 
analysis separately on a stack of only the major AIM events (AIM 4, 8, 
12, 14 and 17), only the minor AIM events or stacks of eight randomly 
selected AIM events gives nearly identical phasing (Extended Data Fig. 
6), excluding the possibility that the result is dominated by the timing 
of a few prominent events. We repeat the stacking procedure for the 
WDC sea-salt sodium record (Extended Data Fig. 7), and find that this 
tracer, which has been interpreted as a proxy for sea-ice production”, 
changes almost synchronously with WDC 8'*O—possibly reflecting a 
common forcing by Southern Ocean temperatures’. 

At the onset of the NH Bolling warm period (DO 1; 14.6 kyr) we find 
a north-south phasing comparable to that of the glacial period, with a 
256 + 133-year lag of the Antarctic 8'°O response (Fig. 3 and 
Extended Data Table 1). The interpolar phasing at the onset and 
termination of the Younger Dryas stadial is ambiguous in our record 
(Extended Data Table 1), mostly because of the relatively gradual 
nature of the Younger Dryas onset and the presence of two local 
maxima in the WAIS-D 8'*O record around the Younger Dryas- 
Holocene transition (possibly reflecting regional climate). A recent 
study using a CH,4-synchronized stack of five near-coastal Antarctic 
cores spanning the deglaciation suggested a near synchrony of 
Antarctic breakpoints and Greenland transitions within a dating 
uncertainty of 200 years’. Our result overlaps with this range. 
However, the much smaller dating uncertainty at WDC and our use 
of the full sequence of glacial AIM events shows that a centennial-scale 
Antarctic lag is a systematic feature of DO-AIM coupling throughout 
the glacial period. 

The lead of NH climate revealed at WDC (Fig. 2) provides clues 
about DO climate dynamics. The simplest interpretation is that the 
abrupt phases of the DO cycle are initiated in the NH, presumably in 
the North Atlantic. This is consistent with several proposed DO 
mechanisms, such as North Atlantic sea-ice dynamics’, freshwater 
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Figure 2 | Interhemispheric phasing of the bipolar seesaw. a, Lagged 
correlation between NGRIP 5'8O and WDC d(8780)/d¢ (blue), and between 
WDC CH, and d(8!8O)/dt (green). Millennial-scale variability is isolated by 
using a fourth-order Butterworth bandpass filter (500-10,000-year window); 
the CH4-synchronized part of the records is used (31.2-68 kyr). b, DO 3-18 
stack of NGRIP 8'%O (blue), WDC CHy (green) and WDC 580 (orange with 
curve fit), aligned at the midpoint of the DO warming signal. Events are 
averaged with their original amplitudes and normalized after stacking for 


forcing* or ice shelf collapse’. We acknowledge, however, that the 
abrupt North Atlantic events could be the response to a remote process 
not visible in the ice core records; therefore, although we cannot ascer- 
tain the location of the elusive DO ‘trigger’ (if such a concept is even 
meaningful in a highly coupled dynamical system), our results clearly 
indicate a north-to-south directionality of the abrupt phases of the 
bipolar seesaw signal. 

The centennial timescale of the NH lead demonstrates the dom- 
inance of oceanic processes in propagating NH temperature anomalies 
to the SH high latitudes. Any atmospheric teleconnection would be 
manifested within at most a few decades after the abrupt NH event"®; 
on this timescale the WDC 8'°0 stacks (Fig. 2b, c) show no discernible 
temperature response above the noise. We estimate the noise level by 
subtracting the fitted curves from the 5'°O stacks; the remaining signal 
has a 2o variability of 0.14%, or about 0.18 °C assuming an isotope 
sensitivity of 0.8%0 K~'. We therefore estimate an upper bound of 
0.18°C on an atmospherically induced Antarctic temperature res- 
ponse (for comparison, millennial Antarctic temperature variations 
are on the order of 1-2 °C). A readjustment of atmospheric transport 
may induce spatially inhomogeneous temperature changes*'® that 
may be (partly) responsible for the heterogeneity in expression, or 
shape, of AIM events across Antarctica’. 

We find that on average the DO cooling signal is transmitted as fast 
to Antarctica as the DO warming signal is (our sensitivity study sug- 
gests a difference in propagation time of 10 + 89 years). This implies 
that the north-to-south propagation time is independent of the AMOC 
background state; that is, it is independent of whether the AMOC is in 
the weak or strong overturning state. Modelling work suggests that the 
meridional propagation time across the ACC latitudes depends prim- 
arily on ACC strength”’; our inference of similar propagation times 
may thus reflect a stability of the ACC on millennial timescales. There 
is also a conspicuous synchronicity between the phasing of the bipolar 
seesaw and the duration of the abrupt increase in CO, at 14.6 kyr 
(Fig. 3d); if the former does indeed reflect the timescale of the oceanic 
response, this may hold clues about the (unidentified) source of this 
CO, (ref. 26). 


NH lead time (years) 


convenience of visualization. c, As in b, but for NH abrupt cooling events (that 
is, the interstadial terminations). d—f, Histograms of NH lead time associated 
with a-c, respectively, generated by binning solutions from the sensitivity 
study. The total number of solutions is 4 x 10° in d, and 4 X 10° ine and 

f. Distribution mean and 2e probability bounds are listed in the panels. Shaded 
vertical yellow bars (upper panels) show NH lead time; the error bar represents 
20 as defined for the lower panels. 


Proxy records of North Atlantic ventilation and overturning 
strength during Marine Isotope Stage 3 typically show the most prom- 
inent excursions during Heinrich stadials'’"”, periods of extreme 
cold in the North Atlantic associated with layers of ice-rafted 
debris in ocean sediments that represent times of massive delivery of 
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Figure 3 | Timing of the last deglaciation. a, NGRIP 5'°O on GICC05 
chronology’. b, WDC CHy. c, WDC 880 with breakpoints as orange dots and 
error bars showing the 2¢ uncertainty bounds (Extended Data Table 1). 

d, WDC CO, data (dots) with spline fit (solid line)**. Period abbreviations: OD, 
Oldest Dryas; B-A, Bolling-Allerad; ACR, Antarctic Cold Reversal; YD, 
Younger Dryas; Holoc., Holocene. Vertical orange lines correspond to the 
midpoints of the WDC CH, transitions. NGRIP and WDC chronologies are 
both based on annual-layer counting, and are fully independent. 
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debris-laden icebergs (Heinrich events). This suggests that there may 
be a difference between climate dynamics during Heinrich and non- 
Heinrich stadials. We find no anomalous NH lead time for DO transi- 
tions directly after or before Heinrich stadials (Extended Data Fig. 6), 
and thus no evidence that Heinrich stadials are unusual from the 
perspective of the oceanic teleconnections that dominate the bipolar 
seesaw. Similarly, previous work has shown that an increase in 
Antarctic temperature is linearly related to Greenland stadial duration, 
irrespective of the occurrence of Heinrich events during these stadials’. 
Recent studies suggest that decreases in AMOC strength precede the 
Heinrich events, allowing the possibility that the latter are a response 
to AMOC variations rather than their cause’””*. The main influence of 
Heinrich events on the bipolar seesaw may thus be to lengthen the 
stadial periods during which they occur by suppressing the AMOC by 
means of iceberg-delivered freshwater, allowing large amounts of heat 
to build up in the Southern Ocean. 

Although both the data’ and the models'””' suggest a fast temper- 
ature response in the South Atlantic (decadal-scale in models), there is 
currently no consensus in existing model studies on the physical 
mechanisms and timescales of the processes that propagate temper- 
ature signals between the South Atlantic and the Southern Ocean; 
complicating factors to consider include the lack of a zonal boundary 
to support the propagation of Kelvin waves”, the large thermal inertia 
of the Southern Ocean’, the complexity of eddy heat transport across 
the Antarctic circumpolar current, and the coupling between the 
extent of Antarctic sea ice and the overturning circulation”’. In this 
context, our precise phasing observations provide a new constraint 
with which to test future model simulations seeking to capture the 
dynamics of millennial-scale climate variability. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Core recovery and processing. The location of the WAIS Divide ice core 
(79.48° S, 112.11° W) was selected to provide the highest possible time-resolution 
record of Antarctic climate during the past 50 kyr or more, and to ensure that the 
record from the last deglaciation would not be in ice from the lower-quality brittle- 
ice zone”. The site has a present-day ice accumulation rate of 22 cm yr’, divide 
flow andan ice thickness of 3,460 m (J. Paden, personal communication 2011). No 
stratigraphic irregularities were detected in the core using visual observation, 
multitrack electrical conductivity measurements or gas stratigraphy. Coring was 
stopped 50m above the bed to prevent contamination of the unfrozen basal 
environment. Drilling of the main core (WDCO06A) started during the 2006/ 
2007 field season and finished during the 2011/2012 field season, at the final depth 
of 3,405 m. Drilling was done with the US Deep Ice Sheet Coring (DISC) drill, a 
new drill designed and operated by the Ice Drilling Design and Operations group 
(engineering team at University of Wisconsin, Madison). Core handling and 
distribution of ice samples were performed by the National Ice Core Laboratory 
(US Geological Survey) and the Science Coordination Office (University of New 
Hampshire and Desert Research Institute). The drill and core handling methods 
had many innovations that improved core quality, especially in the brittle ice. 
These included improved mechanical characteristics of the drill, better control 
of the drilling operation, core handling procedures that minimized thermal and 
mechanical shock to core, netting to contain the brittle ice, and temperature 
tracking of all ice samples from the drill to the laboratories. 

Data description. Water '8O/'°O composition (5'°O) was measured at IsoLab, 
University of Washington, Seattle, Washington. Measurement procedures for 
WDC have been described elsewhere'”*'. In short, measurements were made at 
0.5 m depth averaged resolution, using laser spectroscopy (Picarro L2120-i water 
isotope analyser). 5'°O data are reported relative to the VSMOW (Vienna 
Standard Mean Ocean water) standard, and normalized to the VSMOW-SLAP 
(Standard Light Antarctic Precipitation) scale. 

Two separate data sets of atmospheric methane exist for WDC. The first is based 
on discrete sampling of the core; it was measured jointly at Penn State University 
(0-68 kyr, 0.5-2m resolution) and at Oregon State University (11.4-24.8 kyr, 
1-2m resolution). For this record the air was extracted using a melt-refreeze 
technique from ~50 g ice samples, and analysed on a gas chromatograph with a 
flame ionization detector*’. Corrections for gas solubility, blank and gravita- 
tional enrichment for the Oregon State University data were performed in 
accordance with ref. 33; corrections to the Penn State University data are described 
in ref. 17. A second data set (R. H. Rhodes, E. J. Brook, J.C. H. Chiang, T. Blunier, 
O.J. Maselli, J.R. McConnell, D. Romanini and J. P. Severinghaus, unpublished 
observations) is based on continuous flow analysis (CFA) in combination with 
laser spectroscopy***’. This data set, which is of higher temporal resolution, was 
used to determine the timing of the abrupt DO transitions (Extended Data Fig. 2). 
All CH, data are reported on the NOAA04 scale**. 

WDC Na concentrations were measured at the Ultra Trace Chemistry 

Laboratory at the Desert Research Institute, Reno, Nevada, by means of CFA. 
Sticks of ice were melted continuously on a heated plate. Potentially contaminated 
meltwater from the outer part of the ice stick was discarded. Two inductively 
coupled plasma mass spectrometers were used to analyse the stream of melt- 
water’. The CFA chemistry and CH, data were measured simultaneously on 
the same ice samples, and all measurements are co-registered in depth. 
Age scale and Aage. For WDC we use the WD2014 chronology, which is based on 
annual-layer counting down to 2,850 m (31.2 kyr Bp), and on stratigraphic match- 
ing by means of globally well-mixed methane for the deepest part of the core 
(2,850-3,405 m)**. The deep section of the WD2014 chronology has been syn- 
chronized to a linearly scaled version of the layer-counted Greenland ice core 
chronology (GICC05). Multiplying the GICC05 chronology by 1.0063 brings 
the ages of abrupt DO events as observed in Greenland ice core records in agree- 
ment with the ages of these events observed in the U/Th-dated Hulu Cave spe- 
leothem record (which provides better absolute age control)”*. The NGRIP data in 
Fig. 1a have similarly been placed on the 1.0063 < GICCO5 age scale; NGRIP data 
in Fig. 3a are plotted on the original GICC05 chronology. 

WDC permits precise interhemispheric synchronization for two reasons. First, 
as a result of the high accumulation rates, Aage remains relatively small. WDC 
Aage is about an order of magnitude smaller than Aage in cores from the East 
Antarctic Plateau, and about one-third of Aage at other coastal sites that cover the 
last glacial period (Extended Data Fig. 1a)’. As the uncertainty in Aage is pro- 
portional to Aage itself, this leads to smaller dating uncertainties at WDC. Second, 
recent technological developments in coupling laser spectrometers with ice core 
continuous-flow analysis™ have resulted in a WDC CH, record with the highest 
temporal resolution of all Antarctic ice cores so far. Extended Data Fig. 2c com- 
pares the WDC CH, record (grey, ~2-year sampling resolution) with the EDML 


CH, record (orange, ~100-year sampling resolution). The estimated uncertainty 
in WDC Aage (ref. 23) is shown in Extended Data Fig. 1b-e. 

Note that the transition between the layer-counted and CH,-synchronized 
sections of the WD2014 chronology at 31.2 kyr does not influence our analysis 
of the bipolar seesaw phasing. In the stacking procedure we align each of the 
individual Antarctic events at the midpoint of the associated DO CH, transition. 
In doing so, we effectively synchronize all the WDC events to NGRIP 880, not 
just the events between 68 and 31.2kyr. In performing the cross-correlation 
between NGRIP and WDC (Fig. 2a), we only analyse the CHy-synchronized part 
of the record. 

A dynamical version of the firn densification model*** is used to calculate the 
Aage of the WD2014 chronology”. We furthermore calculated WDC Aage using 
an alternative firn densification model’**””° (Extended Data Fig. 1b, blue curve). 
Using this alternative Aage scenario gives a NH lead time of 242 and 247 years for 
abrupt DO warming and DO cooling, respectively, in good agreement (within the 
lo uncertainty bound) with results presented in the text. 

Identification of abrupt DO transitions. We use the midpoint of the abrupt 
transitions as time markers to align individual events. The method for finding the 
midpoint is identical to that used in ref. 23, where further details can be found. The 
method is shown in Extended Data Fig. 2a, b for DO 17 and 16 (we use a recently 
proposed DO nomenclature*’). For each transition we define a pre-transition level 
and a post-transition level (horizontal orange markers), and use linear interpola- 
tion to find the depth at which the tracer of interest (5'*O or CH,) has completed 
25%, 50% and 75% of the transition from the pre-transitional to post-transitional 
value. The 50% depth (red dots) is used to define the timing of the abrupt event; the 
25% and 75% depths (blue dots) provide an uncertainty range on the midpoint”. 

It has been suggested that interhemispheric CH, synchronization is compli- 

cated by the fact that gas bubbles in ice cores have a gas age distribution, rather 
than a single age’. To investigate this effect at WDC, we construct a hypothetical 
gas age distribution (Extended Data Fig. 2d), using a (truncated) log-normal 
distribution as suggested elsewhere***’, in which we conservatively set o = 1 
and ju = In(50). The spectral width (or second moment) of this age distribution 
is 24.3 years; for comparison, the gas-age spectral width for present-day WDC firn 
is only 4.8 years“. Applying this filter to an atmospheric ramp in CH, (Extended 
Data Fig. 2e) shifts the midpoint of the transition (as recorded in the ice core) 
forwards by 5 years. Because we have conservatively chosen the age distribution to 
be very wide, this 5-year shift should be considered an upper bound. The influence 
of the gas age distribution on the transition midpoint identification is very small in 
comparison with other sources of uncertainty, and is neglected in the remainder of 
the manuscript. 
Stacking procedure. We define a time vector t that runs from time t = —1,200 to 
t = 1,200 in 1-year increments. Next, for each of the DO/AIM events, we set the 
midpoint of the NGRIP 5'0 transition to tf = 0, and use linear interpolation to 
sample the different records onto time vector t. For each individual DO/AIM 
event this results in a record that spans from —1,200 to +1,200 years relative to 
the abrupt transition, at annual time steps; we shall refer to these as the contrib- 
utory records. To align the WDC records we set the midpoint in the WDC CH, 
transition to t = 56 years, using the observation that the CH, transition slightly 
lags the Greenland 5'°O signal>***“” (the uncertainty in this time lag is evaluated 
in the sensitivity study). With all the individual events synchronized and 
resampled to identical time spacing, we can average them to obtain the stacked 
record. 

The spacing between DO events varies widely over time, ranging from several 
thousands of years to only hundreds of years. This means that in several cases the 
2,400-year window we use contains more than a single event. We crop the con- 
tributory records whenever an adjacent event occurs within the sampling window 
(Extended Data Fig. 3). For widely spaced (that is, long-duration) events such as 
DO/AIM 12, no adjacent events fall within the time window, and no cropping is 
required (Extended Data Fig. 3a). For closely spaced events, such as DO/AIM 17.1, 
we need to crop neighbouring events (Extended Data Fig. 3b). The cropped parts 
of the contributory records are replaced with constant values that equal the bound- 
ary values (50-year averages) of the uncropped part of the record. The number of 
contributory records available as a function of time is shown in the bottom half of 
Extended Data Fig. 4. The shortest contributory record is the AIM 17.1 record, 
which is 700 years long. 

Determining the breakpoint in the WDC 5'%0 stack. We use an automated 
routine that is similar to the BREAKFIT algorithm”, the main difference being 
the use of a second-order polynomial (rather than a linear) fit to the data, to 
account for the fact that the WDC 8180 stack is curved rather than linear. The 
algorithm finds the breakpoint that minimizes the root mean square deviation 
(r.m.s.d.) between the stack (that is, the data) and the polynomial fit in the —600 
to +700 years time interval (Extended Data Fig. 4); the interval was chosen 
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asymmetrically around zero to account for the fact that the breakpoint is always 
found at positive values. The routine operates as follows. 

First, the user selects the range in which the algorithm is to look for the break- 
point, as well as a time step. For example, if the user selects 0-400 years at a 100- 
year time step, the algorithm will test the possibilities t = 0, 100, 200, 300 and 
400 years. We shall refer to this one-dimensional array of values as the input 
vector. 

Second, at each of the values in the input vector, the 3180 stack is split into two 
pieces, and a second-order polynomial curve is fitted to data on each side sepa- 
rately. The two fitting curves are merged at their point of intersection. Note that 
the point of intersection can differ from the point where the data series were split in 
two. 

Third, for each fit thus obtained, we calculate the r.m.s.d. to the stacked data 
over the —600 to +700 years time interval, and we select the best fit. The point of 
intersection of the two curves, rather than the point where the data series were split 
into two (that is, the values in the input vector), is considered the breakpoint in the 
3'°0 stack. 

For most of the results in this work we let the algorithm search in the 0-400-year 
range, at a 2-year time step. The exception is the stacks of randomly selected DO 
events (Extended Data Fig. 6), for which we used a wider range (—100 to 
+500 years) to account for the larger spread in solutions. Note that the effective 
range of breakpoint detection is wider than the input range, because the point of 
intersection of the fitting curves is permitted to lie outside the range specified by 
the input vector. The fitting curves generated by the algorithm are plotted in dark 
orange lines in Fig. 2b, c and Extended Data Figs 4a, b, 6a-d and 7b, c; the break- 
point is indicated with an orange dot. The MATLAB code implementing the fitting 
procedure is provided in Supplementary Information. 

The reason for using a quadratic rather than a linear fitting procedure is that 
Antarctic temperatures do not change in a linear fashion either before or after the 
abrupt transitions. Asa test, we apply our fitting procedure to the WDC 5'°0 stack 
(Fig. 2b), using either linear or quadratic fitting functions, in which we change the 
size of the data window from 300 years to 1,600 years. The window is applied 
symmetrically, meaning that equal numbers of years are used before and after 
the detected breakpoint (800 years on each side for a 1,600-year window). Data 
falling outside this window are ignored in the fitting procedure. 

We find that the performance of the quadratic fitting procedure is superior in 

two important aspects (Extended Data Fig. 5). First, by using a quadratic fit, the 
breakpoint we detect is independent of the window size, which is not true for the 
linear fit. Second, the quadratic method provides a better approximation to the 
data, as expressed by the r.m.s.d. between the data and the fitting curve. At small 
window sizes of <700 years, both methods agree very well, and the fit to the data is 
comparable in terms of the r.m.s.d. This is due to the fact that at small window sizes 
the curvature of the time series is not very pronounced and is well approximated by 
a linear trend. We have also tested the BREAKFIT routine®’, which uses a linear fit 
on both sides of the breakpoint. We find a very good agreement between the 
breakpoints identified using the BREAKFIT routine and our linear fitting routine, 
with a mean difference of 0.2 years between them. Using a moving block bootstrap 
analysis**, the BREAKFIT algorithm suggests a 2o uncertainty of 50.8 years in 
breakpoint identification. 
Monte Carlo sensitivity study. To assess how the different uncertainties in our 
method affect our conclusions, we perform a Monte Carlo sensitivity study. The 
largest source of uncertainty is the difference between gas age and ice age, or Aage 
(ref. 49). Aage determines the relative timing of the WDC CH, and 8'°O records, 
and thereby the phasing we infer for WDC AIM events relative to Greenland 
climate”*”°. Increasing Aage shifts WDC ice ages towards older ages, thereby 
decreasing the NH lead time that we infer; vice versa, decreasing Aage will shift 
the ice ages younger, increasing the observed NH lead time. The Aage reconstruc- 
tion we use here is obtained from a dynamical firn-densification model***”*!"** 
constrained by 5'°N-N, data, a proxy for past firn-column thickness®. The firn- 
densification model results are in good agreement”’ with the alternative Adepth 
method developed recently”*. For our current purpose we use an ensemble of 1,000 
Aage histories (Extended Data Fig. 1b-e) generated by varying a suite of input 
parameters of the firn-densification model”. We furthermore use four different 
depth-age interpolation strategies***’, providing a total of 1,000 x 4 = 4,000 
alternative WDC chronologies. 

For each of these 4,000 potential WDC chronologies we further perform a 
Monte Carlo simulation (100 realizations) in which we perturb the alignment of 
the contributory records. The rationale behind the Monte Carlo study is that there 
is an uncertainty in the synchronization of the contributory records, and we want 
to assess how this uncertainty impacts our conclusions. In the Monte Carlo study, 
a random timing error is assigned to each of the contributory records, and thereby 
they are effectively shifted relative to one another in time. We distinguish between 
systematic and non-systematic errors, in which the former are correlated between 
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the different AIM events, and the latter are not. Sensitivity tests with the firn 
densification model show that Aage is a systematic error; if we have overestimated 
glacial Aage by, for example, 50 years, this will probably be true for all of the AIM 
events. We consider five independent sources of uncertainty in the synchroniza- 
tion procedure. 

First, the midpoint detection in the abrupt NGRIP 8180 transitions has an 
uncertainty; we shift the record by a value drawn from a Gaussian distribution 
of zero mean and a width that equals the midpoint uncertainty as defined above. 
This error is non-systematic. 

Second, as in the first point above, but for the midpoint in the abrupt WDC CH, 
transition. 

Third, although all four interpolation strategies synchronize the abrupt NH 
warming events between NGRIP and WDC, only two of them synchronize the 
NH cooling events”. For stacks of NH cooling we added an additional uncertainty 
drawn from a Gaussian distribution of zero mean and a width corresponding to 
the age difference between the cooling event as detected in NGRIP 5'°O and WDC 
CH4. This error is non-systematic. 

Fourth, a recent extensive analysis suggests a lag of CH, behind Greenland 5'°O 
of 56 + 38 years (20)”*. There are two types of uncertainty to consider: first, the 
CH, lag can differ between individual DO events (non-systematic error), and 
second, the central estimate of 56 years has an inherent uncertainty (systematic 
error). We here assume that both are equally large, and in the stacking procedure 
we assign both a 27-year systematic error and a 27-year non-systematic error to the 
CH, lag time. Both are drawn independently from a Gaussian distribution, and 
their combined uncertainty is therefore (277 + 277)'? = 38 years (that is, the 20 
uncertainty bound”). 

Fifth, the breakpoint fitting procedure has an intrinsic uncertainty. We estimate 
this by using the moving-block bootstrap analysis of the BREAKFIT algorithm’, 
which suggests a 1o uncertainty of 25.4 years (Extended Data Fig. 5a). We there- 
fore apply a 50.8-year (2c) systematic uncertainty (systematic because it applies to 
the full 8'°O stack rather than to individual events). 

Each contributory record is then individually shifted by an amount that corre- 
sponds to the sum of the randomly assigned non-systematic errors. After realign- 
ing the contributory records in this manner, the entire stack is shifted in time by an 
amount that corresponds to the sum of randomly assigned systematic errors. We 
detect the breakpoint by using the fitting algorithm. For each of the 4,000 chro- 
nologies we test 100 realizations of the Monte Carlo realignment, thus obtaining a 
total of 4 X 10° (1,000 X 4 X 100) estimates of the WDC 8'°O breakpoint. No 
statistical methods were used to predetermine the sample size used in the 
Monte Carlo analysis. The outcome of the sensitivity study is presented in the 
histograms of Fig. 2e, f and Extended Data Fig. 6e-h. 

The non-systematic errors, because they are independent of each other, accu- 
mulate slowly when added in quadrature. The systematic errors (such as the Aage 
uncertainty or the breakpoint fitting procedure) therefore dominate the final 
uncertainty estimate. Running the Monte Carlo experiment with Aage uncertain- 
ties withheld gives a 2c uncertainty bound on the interpolar phasing of 60 and 
68 years for the DO onset and DO termination stacks, respectively; using only 
Aage uncertainties gives a 2o uncertainty bound of 69 years. The Aage uncertainty 
is therefore as important as all other sources of uncertainty combined. 

Code availability. The MATLAB code used to stack the records and perform the 
Monte Carlo analysis are provided in Supplementary Information. 

Alternative stacks. To exclude the possibility that timing of the WDC 5'°O stack 
is dominated by a few prominent AIM events we perform the following tests. 

First, we distinguish between major AIM events that are very prominent in the 
record, and minor events. The major AIM events are AIM 17, 14, 12, 8 and 4; all of 
these are preceded by NH Heinrich events. These large events have historically 
been called A-events, and are clearly visible even in low-resolution Antarctic cores 
(AIM 8, 12, 14 and 17 have been referred to in earlier work as Al, 2, 3 and 4, 
respectively)**. The minor events are the remaining ones (AIM 3, 5.1, 5.2, 6, 7,9, 
10, 11, 13, 15, 16 and 18). When we compare a stack consisting of only the major 
AIM events with a stack of only the minor AIM events, we find no statistically 
significant difference in their timing (Extended Data Fig. 6a, b). The NH lead time 
for the major and minor AIM stacks is 230 + 93 and 211 + 94 years, respectively 
(Extended Data Fig. 6e, f); the timing of the full stack (218 + 92 years) is inter- 
mediate to these values, as expected (Fig. 2b). 

In a second experiment we randomly selected eight DO/AIM events in the 
stacking procedure for each of the 4 X 10° realizations of the Monte Carlo experi- 
ment; one such realization is shown in Extended Data Fig. 6c, d for abrupt 
NH warming and cooling, respectively. The NH lead time we find in this experi- 
ment is 210+ 113 years and 208+ 108years for abrupt NH warming and 
cooling, respectively (Extended Data Fig. 6g, h). These mean values differ by 
only a few years from those found for the full WDC 8'%0 stack (Fig. 2). Note that 
the distribution width of NH lead times in this experiment is wider than the 
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distribution width found for the full 5'°O stacks. The reason for this may be 
threefold: first, a stack with fewer contributory records has a smaller signal-to- 
noise ratio, making the breakpoint detection less certain; second, if the NH lead 
time differs slightly between the AIM events, the average timing of a randomly 
selected subset may differ from the average timing of the full set; and third, in the 
Monte Carlo procedure the contribution of non-systematic uncertainties scales 
roughly in proportion to /N, again suggesting a narrower distribution for higher N 
(the number of events in the stack). The wider distribution found on decreasing 
the number of events in the stack therefore does not represent a more realistic 
uncertainty bound on the phasing of the bipolar seesaw. 

Stacking the WDC Na record. Last, we investigate the phasing of the WDC sea- 
salt Na (ssNa) record relative to Greenland and Antarctic climate. The sea-ice 
surface is a major source of sea-salt aerosols to the atmosphere around Antarctica, 
and consequently ssNa has been interpreted as a proxy for the production and 
extent of regional sea ice*°*°. Sea-salt Na is strongly anti-correlated with 
Antarctic climate on both orbital and millennial timescales*”. 

Measured ssNa concentrations can be converted to a ssNa flux by multiplying 
with the accumulation rates at the site. On the East Antarctic plateau, the Na flux is 
expected to provide a better metric for atmospheric concentrations, because dry 
deposition dominates as a result of the very low accumulation rates”. 
Accumulation rates are much higher at WAIS Divide, however, and Greenland 
ice cores provide a better analogue than those on the East Antarctic plateau. It has 
been shown at Greenland Summit® that sea-salt concentrations in the ice reflect 
the atmospheric variations within 5%, whereas the sea-salt flux underestimates the 
atmospheric variations by about 50%. Thus, the observed variations in WDC ssNa 
concentrations should to first order reflect variations in the atmospheric loading. 

We investigate the relative timing of the ssNa record with the same tools as those 
that we applied to the WDC 8780 record. An evaluation of the lagged correlation 
between NGRIP 8180 and the time derivative of the various WDC records is 
shown in Extended Data Fig. 7a. For ssNa we obtain a centennial-scale NH lead 
time, comparable to that observed for WDC 8180. Stacking of the WDC impurity 
records yields the same picture (Extended Data Fig. 7b, c); the breakpoint in the 
ssNa stacks coincides with the breakpoint in the WDC 8180 stack, and lags the 
abrupt NH transition by roughly two centuries. Note that because 8'O and ssNa 
are both measured in the ice phase, the uncertainty that we estimate for the timing 
of the 8'8O breakpoint (2c of about 90 years) also holds for the impurity records. 

The synchronicity of WDC ssNa and 8'°O variations suggests that Antarctic 
climate and sea-ice extent are closely linked on centennial, or perhaps even sub- 
centennial timescales. This may reflect a common forcing and/or a feedback 
between Southern Ocean surface temperatures and sea-ice extent. The sea-ice 
extent may furthermore be important in forcing Antarctic temperatures (and 
8180)5°2, particularly at the WAIS Divide site, where the marine influence is 
stronger than on the East Antarctic plateau'’. 
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Extended Data Figure 1 | Difference between gas age and ice age (Aage) at 
WAIS Divide. a, Comparison of WDC Aage with other Antarctic cores. Ice 
core abbreviations: EDC, EPICA Dome Concordia; EDML, EPICA Dronning 
Maud Land; TALDICE, Talos Dome; WDC, WAIS Divide. Aage values are 
taken from refs 23, 63-65. The vertical axis is on a logarithmic scale. b, Aage 
uncertainty bounds obtained from an ensemble of 1,000 alternative Aage 
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scenarios; details are given elsewhere”. A Aage scenario obtained with an 
alternative densification model (ref. 39 instead of ref. 38) is shown in blue. 
c-e, Histograms of the 1,000 Aage scenarios at 20 kyr Bp (c), 40 kyr Bp 

(d) and 60 kyr Bp (e); stated values give the distribution mean + the 2¢ 
standard deviation. 
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Extended Data Figure 2 | Determining the timing of the abrupt DO 
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are indicated with blue dots. c, Comparison of WDC CH, (grey) with EDML 


transitions. a, b, DO 17.2, 17.1, 16.2 and 16.1 (from oldest to youngest*') as 

recorded in NGRIP 8780 (a) and WDC CH, (b). Horizontal orange bars denote 
pre-transition and post-transition levels; the transition midpoint (50% of signal 
amplitude) is indicated by a red dot; the 25% and 75% signal amplitude markers 


atmospheric record in the firn column. 
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CH, (orange)***. d, Hypothetical gas-age distribution for WDC due to firn 
densification and gradual bubble closure, using a truncated log-normal 
distribution”. e, Shift in transition midpoint induced by filtering of the 
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Extended Data Figure 3 | Cropping of individual records in the stack to 
prevent overlap of events. a, DO/AIM 12, where no cropping is needed. 


b, DO/AIM 17.1, where the most cropping is needed. Full time series with five- 


point running average are plotted in grey, and the contributory records are 
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plotted in blue and orange for NGRIP and WDC, respectively. The yellow 
vertical shading bar in background shows the NH lead time (200 years); the 
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purple rectangle gives the —1,200 to +1,200 time window. 
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Extended Data Figure 4 | Number of records and fitting procedure. 
Number of contributory records to the WDC 8180 stacks for abrupt NH 
warming (interstadial onset) (a) and for abrupt NH cooling (interstadial 
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termination) (b). Blue rectangles indicate the time window over which the 
fitting procedure evaluates the fit to the data (—600 to +700 years); shaded 
vertical yellow bars show NH lead time. 
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Extended Data Figure 5 | Evaluating the performance of the breakpoint 
detection algorithm. a, Breakpoint detection as a function of data window size 
using both linear (blue) and quadratic (orange) functions, compared with the 
BREAKFIT algorithm® (grey dots with 1o error bars). The data window is 


applied symmetrically, meaning that equal numbers of years (half the window 
size) are used before and after the detected breakpoint. Data falling outside this 
window are ignored in the fitting procedure. b, Root mean square deviation 
between the WDC 8/80 stack and the fitting curve. 
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Extended Data Figure 6 | Alternative stacking of AIM events. a, Stack of 
NGRIP 87%0 (blue), WDC CH, (green) and WDC 8180 (orange with fit) for 
just the major AIM events (4, 8, 12, 14 and 17), aligned at the abrupt NH 
warming. b, As ina, but for only the minor AIM events (3, 5.1, 5.2, 6, 7,9, 10, 11, 
13, 15, 16 and 18). c, As in a, but for eight randomly selected DO/AIM events. 
d, As in c, but aligned at the abrupt NH cooling. Events are averaged with their 


original amplitudes and normalized after stacking for convenience of 
visualization. e-h, Histograms of NH lead time associated with 

a-d, respectively, generated by binning the 4 x 10° solutions from the 
sensitivity study. The distribution mean and 26 uncertainty bounds are listed in 
the panels. Shaded vertical yellow bars (upper panels) show NH lead time. 
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Extended Data Figure 7 | Timing of sea-salt sodium. a, Lagged correlation 
between NGRIP 5!°0 and the time derivative of WDC 8'8O (orange), and 
between NGRIP 8'%O and the time derivative of WDC ssNa (grey). The dots 
indicate the maximum (anti-)correlation at 167-year and 229-year NH lead for 
WDC 53'%0 and ssNa, respectively. A fourth-order Butterworth bandpass filter 
with a 500-10,000-year window is applied to the time series to isolate the 
millennial-scale variability. b, DO3-18 stack of NGRIP 5180 (blue), WDC CH, 
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(green), WDC §'*0 (orange) and WDC ssNa (grey), aligned at the midpoint of 
the DO warming signal. The estimated breakpoint in the stacks (dots) occurs at 
t = 218 and 195 years for WDC 5180 and ssNa, respectively. c, As in b, but for 
the abrupt NH cooling events, with the estimated breakpoint at t = 208 and 
199 years for WDC 3'°O and ssNa, respectively. Shaded vertical yellow bars 
show NH lead times. 
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Extended Data Table 1 | Phasing of the bipolar seesaw during the last deglaciation 


Transition WDC 8"20 WDC CH, Aage Greenland-—CH, Lead of NH 
breakpoint Transition phasing climate 
OD -B/A 14,350 + 110 14,576 + 29 228 + 62 30 + 32 256eeslSsS 
B/A-YD 12,778 + 80 12,764 + 53 218 + 68 30432 16+122 
12,881 + 30T 216 +67 30432 133 +113T 
YD - Holoc. 11,842 + 904 11,542 + 33 AG)S az 5 8X0) az 3) -270 + 1144 


For the deglaciation we use a Greenland-CH, phasing that is the average of the 56 + 38-year CH, lag observed for the glacial period***°, and the near-synchronous phasing (4.5 + 24-year CH, lag) observed for the 
Balling transition®. All stated errors represent 2o uncertainty bounds; uncertainty in the WDC 8!80 breakpoint is determined using the BREAKFIT algorithm’; uncertainty in the CHa transition is defined as the 
25-75% range of the CH, transition (Extended Data Fig. 2); Aage uncertainty is found by using a firn densification model sensitivity study** (Extended Data Fig. 1); uncertainty in the Greenland-CH, phasing is 
taken to be the root mean square of the uncertainties in the cited studies****; the uncertainty in the NH climate is the root sum square of the uncertainties in the preceding columns. The potential role of COs in 
delaying Antarctic cooling at the Balling onset is discussed in ref. 69. 

+ This estimate uses the onset, rather than the midpoint, of the Balling-Allerad to Younger Dryas transition in WDC CH, in determining the phasing. 

This estimate may be unreliable because of a double peak in WDC 81°0 at about this time that may reflect local climate, as well as a unique and pronounced accumulation anomaly starting at about 11.9 kyr Bp. 
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Isotopic evidence for biological nitrogen fixation by 
molybdenum-nitrogenase from 3.2 Gyr 


Eva E. Stiieken', Roger Buick', Bradley M. Guy*+ & Matthew C. Koehler! 


Nitrogen is an essential nutrient for all organisms that must have 
been available since the origin of life. Abiotic processes including 
hydrothermal reduction’, photochemical reactions’, or lightning 
discharge’ could have converted atmospheric N, into assimilable 
NH,*, HCN, or NO, species, collectively termed fixed nitrogen. But 
these sources may have been small on the early Earth, severely limiting 
the size of the primordial biosphere’. The evolution of the nitrogen- 
fixing enzyme nitrogenase, which reduces atmospheric N, to organic 
NH,", thus represented a major breakthrough in the radiation of 
life, but its timing is uncertain**. Here we present nitrogen isotope 
ratios with a mean of 0.0 + 1.2% from marine and fluvial sedimen- 
tary rocks of prehnite-pumpellyite to greenschist metamorphic grade 
between 3.2 and 2.75 billion years ago. These data cannot readily 
be explained by abiotic processes and therefore suggest biological 
nitrogen fixation, most probably using molybdenum-based nitro- 
genase as opposed to other variants that impart significant nega- 
tive fractionations’. Our data place a minimum age constraint of 
3.2 billion years on the origin of biological nitrogen fixation and sug- 
gest that molybdenum was bioavailable in the mid-Archaean ocean 
long before the Great Oxidation Event. 

Nitrogenase is present in approximately 15% of all sequenced pro- 
karyotic genomes, including several phyla of Bacteria and one phylum 
of Archaea, but it is absent in Eukaryota*. The most common form (Nif) 
contains molybdenum in the catalytically active site, but a subset of 
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diazotrophic (N>-fixing) organisms produce ‘alternative nitrogenases’ 
containing V (Vnf) or Fe (Anf)’ instead. It was once thought that Vnf 
and Anf evolved before Nif, possibly in the Archaean era when seawater 
was enriched in Fe and depleted in Mo*”®, but newer phylogenetic evi- 
dence indicates that Nif is ancestral to the alternative isozymes”” and 
may itself have been preceded by a ‘proto-nitrogenase’ enzyme with low 
efficiency and substrate specificity’’. Protein substitution rates’, com- 
bined with the view that marine Mo was scarce before the Great Oxi- 
dation Event (GOE)", have been interpreted as indicating a relatively 
late origin of Mo-nitrogenase around 1.5-2.2 billion years (Gyr) ago”. 
However, other Mo-based enzymes, in particular the molybdopterin 
cofactor, have been traced back phylogenetically to the last universal 
common ancestor’, implying that some Mo was bioavailable in the 
early Archaean. Furthermore, abiotic nitrogen sources may have become 
limiting long before the GOE’*, making it conceivable that Mo-based 
nitrogenase is more ancient than models suggest. 

Nitrogen isotopes of sedimentary rocks can clarify this issue because 
metabolic nitrogen pathways exert characteristic fractionations that can 
be preserved in kerogen over geological time. Moderately positive nitro- 
gen isotope ratios (8°N = [(1°N/ MN etanial (°N/"*N) standard] — 1) from 
the late Archaean (2.7-2.5 Gyr) indicate an aerobic nitrogen cycle involv- 
ing nitrification and denitrification’*™*. It is widely assumed that before 
about 2.7 Gyr, the nitrogen cycle was anaerobic", consisting only of nitro- 
gen fixation and ammonification. As the latter has negligible isotopic 


Metamorphosed bulk 
A Fortescue fluvial bulk 


© Witwatersrand marine kerogen 


© Witwatersrand fluvial bulk 


-10 rt et a or oo oo oT od oP oo on oo: 


305 2 
205 
= ‘| 
ze <2 - 
pod xa 
“© 1074 o% 
| 3 . 
O--=--=-e%s- 
q ° 
H x 
4.0 3.5 3.0 2.5 


Figure 1 | Sedimentary 5'°N through time. All points labelled ‘Soanesville, 
‘Witwatersrand’, “Nullagine’, and ‘Fortescue’ are from this study, all others 
are compiled from the literature. ‘Hydrothermal’ refers to samples from 
hydrothermal veins; ‘Metamorphosed’, metamorphosed above greenschist 
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facies; “Witwatersrand’, Witwatersrand Supergroup. Insert shows data from 
this study at higher resolution; note that Soanesville Group data at 3.2 Gyr 
represent 20 superimposed points. See Supplementary Information for 
references. 
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Table 1 | Sample localities and depositional environments 


Unit Age (Gyr) Depositional setting MG 
Soanesville Group 3.19 Upper continental slope PP 
Nullagine Group 2.98 Outer shelf GR 
Witwatersrand 2.92 Fluvial GR 
Supergroup 2.87 Shallow marine, near rivers GR 

2.96 Marine shelf GR 
Fortescue Group 2.76 Fluvio-lacustrine PP 


LETTER 


n 57° Noutk (%0) TN (p.p.m.) 5°°Cong (Y6o) TOC (%) 
20 +0.1+05 247 + 146 —31.0+03 13+05 
1* +0.1+04 42+3 -40.7 +0.0 0.2+0.0 
8 +0.9+19 65 +63 -418+7.1 0.2+0.2 
5 +1.2+1.0 47 +18 -37.8+41.2 0.5+0.2 
5 -16+08 105 +48 -28.2+09 0.5+0.5 
13 -0.7+08 22+7 —36.0+53 0.1+01 


MG, metamorphic grade: PP, prehnite-pumpellyite; GR, greenschist. n, Number of samples. Isotopic uncertainties are reported as lo. 
* For the Nullagine sample, the reported standard deviation is equal to the analytical uncertainty of triplicate analyses. 


effects (¢ = 1%o)'*, the isotopic signature of biological nitrogen fixa- 
tion should be discernible in the earlier Archaean rock record. Nif can 
impart fractionations down to —4%o in fixed nitrogen relative to atmo- 
spheric No, as seen in some thermophilic organisms, but most marine 
observations are in the range —2%o to +1%o (average —1%o)”"”"*. The 
less efficient alternative nitrogenases Vnf and Anf fractionate nitrogen 
isotopes by —6%bo to — 8%o (ref. 7). The fractionation imparted by poten- 
tial proto-nitrogenases is unknown, but given their presumed low effi- 
ciency and serendipitous filling of their catalytic sites by various metals, 
including Fe or V (ref. 9), fractionations may have been relatively large, 
close to values observed for Vnf and Anf, and probably quite variable. 
The composition of atmospheric N, has evidently been between —2%o 
and +2%bv since the early Archaean’®. Hence, if 8!°N values of Archaean 
sediments fall consistently within a narrow range, then they may reveal 
if nitrogenase had evolved and which isozyme was dominant. However, 
published 8'°N data from early and mid-Archaean rocks are widely 
scattered (—3.8%o0 to +12.7%o; Fig. 1), partly because of metamorphic 
alteration to above greenschist facies, metasomatism by ammoniated 
brines, hydrothermal input during deposition, or high analytical uncer- 
tainty (ref. 15 and Supplementary Information 1). Existing data are thus 
inconclusive as to the presence of microbial nitrogen fixation. 

To place a better constraint on the origin of nitrogenase, we analysed 
nitrogen and carbon isotope ratios in mudrocks of prehnite-pumpel- 
lyite to greenschist metamorphic grade from marine and non-marine 
environments between 3.2 and 2.75 Gyr (Table 1 and Supplementary 
Information 2). The samples showed no evidence of post-depositional 
fluid alteration. Contributions of detrital nitrogen were minor and limited 
to fluvial samples (Supplementary Information 3.1 and Extended Data 
Fig. 1). We analysed decarbonated whole-rock samples and a few kerogen 
isolates by elemental analyser—isotope ratio mass spectrometry (ref. 20 
and Methods). Unlike cherts, which were commonly used in previous 
studies, mudrocks are relatively kerogen-rich and widespread, making 
them ideal for obtaining accurate data over long timescales. Phyllosilicate 
minerals can accumulate NH, released from kerogen during diagenesis. 
Therefore, whole-rock samples containing both organic nitrogen and 
mineral-bound ammonium should come closest to preserving the ori- 
ginal isotopic composition of biomass (Supplementary Information 3.1), 
if infiltration of fluids from other sources can be ruled out. To detect 
possible alteration, as well as to quantify the availability of Mo, V, and 
Fe in the depositional environments, we determined major and minor 
element abundances by inductively coupled plasma-atomic emission 
spectrometry and inductively coupled plasma—mass spectrometry (Sup- 
plementary Tables A3 and A4). 

Our results for marine samples (n = 26) showed a narrow range 
(—2.8%0 to +0.7%b; Fig. 2) for mid-Archaean 3)°Npuk With an average 
of —0.2 + 0.9%0. Most notably, 20 samples from the 3.2 Gyr Soanesville 
Group, collected from separate drill-cores spanning several hundred 
metres of stratigraphy, almost overlapped, ranging from —0.9%v to +0.7%o 
(mean = +0.1 + 0.5%o). Fluvial samples from the Fortescue Group 
and the Witwatersrand Supergroup (n = 26) had a similar mean value 
(+0.2 = 1.5%o); only four samples were heavier than + 1%o. We included 
marine Witwatersrand sediments deposited proximal to river inflows 
in the fluvial category, because they were probably influenced by fresh- 
water runoff. Their organic carbon isotopes (Table 1) were similar to the 
fluvial facies, supporting this reclassification. The four heavier 5° Noutk 
values in fluvial samples from the Witwatersrand basin may reflect local 


redox processes, consistent with sulphur isotope data indicating mild 
oxidative weathering on land at this time”’. Apart from these four 
points, all our data fell within the range of Mo-based nitrogen fixation 
(Fig. 2). This is also true if 3'°N values from the Witwatersrand Super- 
group and Nullagine Group increased by 1-2%vo during greenschist 
metamorphism”. The 17 kerogen isolates were lighter than whole rocks 
(by — 1.4% to — 4.1%; Extended Data Table 1), consistent with diagenetic 
alteration under anoxic conditions (Supplementary Information 3.1). 
Nitrogen isotopic composition was independent of total organic car- 
bon content (TOC; Fig. 2) and hence was probably not influenced by 
productivity, because sedimentation rates, the other control on TOC 
content, are unlikely to be highly variable in similar environments such 
as within the Soanesville Group. Elemental abundances in the Soanesville 
Group and the Witwatersrand Supergroup, including concentrations 
of Mo and V, were generally close to crustal averages (Fig. 3 and Ex- 
tended Data Fig. 2). This pattern argues against significant metasomatic 
overprinting, 

The narrow range in 5'°Npu Close to the presumed composition of 
the atmosphere is inconsistent with an excess supply of fixed nitrogen 
from abiotic sources, because partial assimilation of either NH," or NO, 
species would have resulted in substantial fractionations of greater than 
10%o (ref. 16). External sources of fixed nitrogen can therefore only be 
reconciled with our data if assimilation was quantitative, and if those 
fixed nitrogen species had an initial 8'°N composition near 0%o. It is 
unlikely that abiotically fixed nitrogen from the atmosphere would fulfil 
this condition. Organic nitrogen compounds formed during electric 
discharge experiments are fractionated relative to N2 by —15%o to 
—25%o, and photodissociation of N. by ultraviolet light leads to even 
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Figure 2 | Whole-rock 5'°N versus TOC. Symbols are as in Fig, 1. Error bars 
are lo of two or three replicate analyses of the same decarbonated sample 
aliquot. Dark blue shading, range of 5'°N values expressed in cultures that use 
the Mo-nitrogenase Nif; light blue, extension of this range seen in some wild 
types and thermophilic organisms using Nif; green, range observed in cultures 
using pure Vnf or Anf”’”"*. Almost all data points plot within the range of 
Mo-based nitrogen fixation. Outliers in fluvial sediments may reflect local 
redox chemistry. 
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Figure 3 | TOC and 5'°N versus Mo and V enrichment factors. a, TOC; 
b, 5'°N. Enrichment factors are calculated relative to average upper continental 
crust (Methods). All values are relatively close to 1.0, indicating insignificant 


larger fractionations”. NO, species produced by lightning were close 
to 0%o in one laboratory experiment”’, but theoretical models and 
measurements of NO, in the atmosphere indicate values between 
—5%p and — 15%b0 (ref. 24), making this mechanism incompatible with 
our data. A significant hydrothermal NH," flux can also be ruled out, 
because it would not have been accessible in fluvial settings such as 
those of the Fortescue and argillaceous Witwatersrand rocks, and the 
similarity in 8'°Nyux values between fluvial and marine samples sug- 
gests that nitrogen was supplied by the same mechanism. Another 
possible explanation for the 8!°Npux values could be nitrogen fixation 
by Vnfand/or Anf to —6%o, followed by a scenario similar to the modern 
ocean, namely quantitative nitrification, then partial denitrification in 
the water column imparting a positive fractionation to residual nitrate, 
and finally nearly quantitative assimilation of that nitrate into bio- 
mass. Today, this process imparts a positive fractionation to the result- 
ing biomass of 5-7%o compared with the initial fixed nitrogen. This 
could conceivably result in 3)°Npuk Values around 0%o if Vnf and/or 
Anf were the dominant nitrogenases. In that case, however, our data 
should show a larger spread, as seen in the modern ocean (—2%o to 
+15%o) where the extent of N> fixation and denitrification varies in 
response to climatic zonation” and ocean circulation’®. Thus, given 
the small variance around 0%o0 (¢ = 1.2%) shown by samples from 
different environments over a long time interval (450 Myr), this 
explanation seems unlikely. In particular, a standard deviation of only 
0.5%o among the Soanesville Group samples suggests a single simple 
fractionation mechanism. 

Overall, our data point to an ecosystem where the supply of nitrogen 
was limited but sufficient to sustain productive microbial ecosystems. 
The most plausible explanation for nitrogen acquisition is therefore bio- 
logical N2 fixation. Mo was probably the dominant catalyst, indicated 
by the small variance of 3!°’Npunk Values around 0%o that is especially 
noticeable in the Soanesville Group; scatter may be due to varying Fe 
availability'’. The alternative nitrogenases Vnf and Anf, and probably 
also potential precursor enzymes with low catalytic efficiency, should have 
caused larger or more variable fractionations than those observed. Modern 
Mo-dependent diazotrophs are capable of fixing N. down to approxi- 
mately 1nM [Mo], or about 1% of modern marine concentrations”. 
These low levels may have been supplied to mid-Archaean rivers and 
oceans by hydrothermal systems or localized oxidative weathering”’*””. 
Mo uptake may have been further enhanced by molybdophores”, if 
they had already evolved. Thus, the absence of appreciable Mo enrich- 
ments in these sediments (Fig. 3) does not preclude Mo bioavailability 
(Supplementary Information 3.2). 

Our results have several important implications. First, the nitrogen 
cycle was predominantly anaerobic before 2.7 Gyr. Any biogenic O2 
(ref. 29) must therefore have had a short lifetime, and oxidative weath- 
ering”* was probably restricted to environments where O, was locally 
produced. Second, nitrogenase is evidently a more ancient enzyme than 
recent models suggest*. Our data from the Soanesville Group constrain 
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authigenic enrichments of V or Mo in sediments. No trends are observed, 
indicating that availability of Mo and V did not influence either TOC or 8'°N. 


the minimum age to about 3.2 Gyr and indicate that Mo was the most 
widely used cofactor. If so, Mo must have been sufficiently bioavailable 
in the mid-Archaean to serve as an enzymatic catalyst. This supports 
phylogenetic evidence for the early evolution of other Mo-containing 
enzymes'*. Discrepancies with recent studies suggesting a Proterozoic 
origin of nitrogenase” may be due to either miscalibration of molecular 
clocks or lateral gene transfer that was unaccounted for in phylogenetic 
analyses. Lastly, the early evolution of Mo-nitrogenase and efficient 
nitrogen fixation would have averted the ‘nitrogen crisis’ predicted to 
have occurred in the Archaean or early Palaeoproterozoic*’, thus allow- 
ing biological evolution and proliferation to proceed apace. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Sample preparation. Sample preparation followed established protocols”. First, 
all outer surfaces were cut or polished off, then the rocks were crushed into sub- 
centimetre chips and sonicated sequentially in hexane, methanol, and deionized 
water (18 MQ) for a few minutes each to remove any modern contaminants. The 
chips were dried at 60 °C in an oven overnight and then pulverized in an aluminium 
oxide ceramic puck mill. The mill was cleaned with pre-combusted sand, deionized 
water, and methanol in between samples. Rock powders were de-carbonated with 
6N HCl for 3 days at 60 °C, where the acid was refreshed after 1 day and stirred 
after 2 days. The de-carbonated samples were washed with deionized water three 
times to remove residual acid and dried at 60 °C. All glassware used during sample 
preparation and storage was combusted in a muffle-furnace at 500 °C overnight. 

Kerogen isolates were prepared following the HF-BF; maceration protocol of 
ref, 31. Five to six grams of untreated rock powder were weighed into a 250 ml Teflon 
bottle, mixed with 100 ml deionized water (18 MQ) and 100 ml concentrated HF 
(reagent grade), and placed into a heated shaking water bath at 55 °C overnight. 
The next day, the HF was decanted after centrifugation for 45 min. Then a mixture 
of 62.5 g H3BO; (reagent grade), 100 ml HO, and 100 ml HF was added to each 
sample and placed back into the heated shaking water bath overnight. This mixture 
results in the formation of BF3, which efficiently removes fluoride precipitates*’. 
The BF; was decanted the following day after centrifugation, the sample was washed 
three times with deionized water, and then transferred with about 10 ml of deio- 
nized water into a scintillation vial. The vials were then placed into a freezer over- 
night and desiccated with a freeze-drier over the following day. Freezing the liquid 
first minimizes loss of kerogen by splashing during evacuation in the freeze-drier. 
Reproducibility of kerogen extraction was tested with an in-house rock standard 
from the 2.5 Gyr Mount McRae Shale (UW-McRae) and was around 0.1%o for 
3!°Nxe, We were not able to extract kerogen from all of our samples owing to limi- 
tations in the amount of sample available and their sometimes low TOC content. 
Isotopic analyses. All analyses were done with an elemental analyser (Costech ECS 
4010) coupled to a continuous-flow isotope-ratio mass spectrometer (Finnigan 
MAT253) via a ThermoFinnigan Conflo III'*?°*. Powders were weighed into 
9mm X 5 mm tin capsules (Costech) and flash-combusted with 10 ml O, (‘macro’ 
setting) at 1 bar and 1,000 °C. Resulting N2 and CO; gases were purified with Cr.03 
(conversion of trace CO to CO) at 1,000 °C, silvered Co30, (capture of sulphur 
gases and halogens) at 1,000 °C, and Cu (conversion of trace NO, to N2) at 650 °C. 
Traces of H2O were scrubbed with Mg(ClO,)2 at room temperature (20-25 °C). N2 
and CO, were then separated chromatographically at 40 °C and analysed sequen- 
tially. For analyses of kerogen isolates, CO2 gas was scrubbed from the samples with 
soda lime at room temperature, which allowed combustion of larger amounts of 
kerogen without any risk of saturating the detector cups in the mass spectrometer. 
Kerogen was mixed with VO, in a 1:1 ratio by mass to enhance combustion effi- 
ciency, although subsequent tests showed that samples also combusted well with- 
out an additional combustion aid (discussed below; Extended Data Fig. 3). 

All measurements were corrected for instrumental mass bias with two prev- 
iously calibrated in-house standards (glutamic acid GA-2 and dried salmon SA, 
calibrated against USGS41 and USGS40 (ref. 33)), which were analysed after each 
set of five samples. Accuracy and long-term reproducibility were monitored with 
a second glutamic acid standard (GA-1) and the UW-McRae rock standard. In all 
runs, accuracy for 5'°N was better than 0.3%o (average absolute value 0.1%o) and aver- 
age long-term precision over 12 months was also 0.3%o. For 3° Cogs the average 
long-term accuracy and average precision were both 0.1%o. Samples were analysed 
in two or three replicates of the same decarbonated powder aliquot; standard 
deviations are provided in Supplementary Table A2. Results are expressed in delta 
notation (S°N = {0°N/"*N]sampte! (’°N/4N] standard ~~ 1}; 3° Corg = {Ci C] pattie! 
[°C/"*C] standard — 1}); reference standards were atmospheric Nj for nitrogen iso- 
topes and Vienna PeeDee Belemnite for carbon isotopes. N and C concentrations 
were reproducible with average precision of 1.7% and 1.2%, respectively. All whole- 
rock samples and kerogen isolates were analysed in duplicates or triplicates. 


Combustion efficiency test. It has been noted that flash combustion may be inef- 
ficient in oxidizing organic carbon quantitatively to CO, such that any residual CO 
may interfere with N, in the mass spectrometer and increase the measured ratio of 
masses 29 to 28, leading to artificially higher 5'°N values***°. Furthermore, NH,~ 
bound to phyllosilicate minerals may not be quantitatively released from the sample**. 
In both cases VO; has been proposed as a combustion aid; however, it can contain 
traces of nitrogen itself that need to be subtracted. 

Ina previous study” using the same analytical protocol and facilities, we dem- 
onstrated that we could achieve good reproducibility in 8'°N (1o = 0.11%o) over a 
range of V,O;/sample ratios from 0 to 1.7 for a shale sample from the 1.5 Gyr Belt 
Supergroup. In addition, the measured total nitrogen concentrations showed no 
correlation with the V.O;/sample ratio (r? = 0.008) and hada high precision (rela- 
tive error 0.96%). Kerogen extraction and mass balance showed that 25% of all 
nitrogen in this sample was organic-bound (with 8'°N = +4.7 + 0.3%0) and 75% 
was silicate-bound (with 5'°N = +5.1 + 0.4%o). The good reproducibility of both 
isotopic ratio and total nitrogen abundance over a range of sample quantities from 
50 mg to 150 mg implies that even in the absence of V2Os the flash combustion tech- 
nique quantitatively oxidizes all organic- and silicate-bound nitrogen in this type 
of sample, where the majority of nitrogen is silicate-bound. Furthermore, the data 
suggest that CO interferences are insignificant for up to 1.66 mg organic carbon, 
which was the highest amount combusted with this sample (TOC = 1.1%, molar 
C/Npuk ratio = 29). 

In the course of this study, we conducted a second test with the UW-McRae 

standard (TOC = 7.7%, TN = 0.11%) to verify that this conclusion holds at high 
organic carbon abundances. We analysed aliquots of powder ranging from 1.6 mg 
to 61.2 mg, equivalent to 0.1-4.7 mg organic carbon, and achieved good reproduc- 
ibility for both 3!°N (1o = 0.19%o) and total nitrogen (relative error 5.8%), without 
addition of V2O; and at constant O, supply during flash combustion. If traces of 
CO were produced, it would be most significant at large sample sizes—that is, at 
small O,/sample ratios. However, the absence of any significant trend in the data 
(Extended Data Fig. 3) indicates that, up to at least 4.7 mg organic carbon, inter- 
ferences with CO can be disregarded. 
Elemental analyses. We submitted aliquots of powder to ALS for major, minor, 
and trace element analyses (ALS method ME-MS61). Samples were digested with HF, 
HNOs, and HCIO,, then analysed in dilute HC] with inductively coupled plasma- 
atomic emission spectrometry and inductively coupled plasma—mass spectrometry. 
We calculated enrichment factors (EF) for Mo and V (EF = ([X]/[Al]) sampte/({X]/ 
[Al]) references Where [X] is the concentration of either Mo or V) with reference to 
average upper continental crust”’ and average Archaean crust”* (Fig. 3 and Extended 
Data Fig. 2). 
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20, 249-255 (1993). 
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Extended Data Figure 1 | Organic carbon (a) and bulk nitrogen (b) isotopes as a function of total abundances. Correlation coefficients refer to data points and 
trend lines of the same colour. See Supplementary Information 3 for discussion. Each sample was analysed in two or three replicates. 
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Relative to avg. upper Archean crust 
Extended Data Figure 2 | Crust-normalized elemental abundances. 
a, Normalized relative to average upper continental crust; b, normalized relative 


because concentrations for those elements in average upper Archaean crust 
to average upper Archaean crust (Methods). Some data points in b are missing 


were not provided. In both parts, elements are sorted by relative abundance in 
upper continental crust. 
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Extended Data Figure 3 | Reproducibility of the UW-McRae in-house 
standard over a range of sample sizes. Filled diamonds, analysed with regular 
CN method; grey circles, analysed with a soda-lime column to scrub CO, from 
the gas stream; horizontal lines, total average (solid) + 1s.d. (dashed). 
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Extended Data Table 1 | Comparison of kerogen and whole-rock data for samples with sufficient material from the Soanesville Group and the 
Witwatersrand Supergroup 


8 Noulk TN = TOC C/Noue = 8° Nyer vd C/Nker F(Nor) fF (Ngi) 

Sample [%o] [ppm] [%] [atom.] [%o] [%o] [atom.] [%] [%] 
Soanesville Group: 
96049 0.48 359 = 1.20 39 -1.29 0.10 274 14 86 
96050 0.40 348 = 0.76 25 -0.99 0.07 252 10 90 
96051 0.42 4231.11 31 -1.63 0.10 278 11 89 
96052 0.40 325 0.85 30 -1.57 0.01 266 11 89 
96053 0.58 4051.23 35 -1.47 0.31 267 13 87 
96054 0.14 392 1.54 46 -2.08 0.02 282 16 84 
96055 0.64 503 1.98 46 -1.75 0.01 289 16 84 
96056 0.63 360 0.96 31 -2.12 0.16 287 11 89 
97SSD32-06 0.66 292 1.55 62 -1.40 0.01 297 21 79 
97SSD32-21 0.37 107. — 1.20 132 -2.24 0.23 302 43 57 
97SSD51-03 -0.86 371 =: 1.42 45 -2.25 0.06 267 17 83 
Witwatersrand Supergroup: 
BABI_1119 -0.59 37 0.16 49 -2.40 0.01 673 7 93 
BABI_115 3.64 135 0.47 41 -0.48 0.30 394 10 90 
TW6_1749.9 0.57 39 0.33 102 -2.17 0.72 1279 8 92 
TW6_1750.2 1.53 64 0.56 104 -1.79 0.05 1251 8 92 
TF1_4400 -1.17 74 1.13 171 -2.28 0.64 1910 9 91 
TF1_4440 -1.33 106 0.78 86 -4.44 0.28 1781 5 95 


Whole-rock data are repeated from Supplementary Table A2. 
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An epigenome-wide association study of total serum 
immunoglobulin E concentration 
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Aristea Binia’, Julian M. Hopkin‘, Ivana V. Yang’, Elin Grundberg®, Stephan Busche®, Marie Hudson’, Lars Ronnblom!’, 
Tomi M. Pastinen®", David A. Schwartz’, G. Mark Lathrop*8, Miriam F. Moffatt?8 & William O. C. M. Cookson?8 


Immunoglobulin E (IgE) is a central mediator of allergic (atopic) 
inflammation. Therapies directed against IgE can alleviate hay fever’ 
and allergic asthma’. Genetic association studies have not yet iden- 
tified novel therapeutic targets or pathways underlying IgE regula- 
tion® °. We therefore surveyed epigenetic associations between serum 
IgE concentrations and methylation at loci concentrated in CpG 
islands genome wide in 95 nuclear pedigrees, using DNA from peri- 
pheral blood leukocytes. We validated positive results in additional 
families and in subjects from the general population. Here we show 
replicated associations—with a meta-analysis false discovery rate 
less than 10° *—between IgE and low methylation at 36 loci. Genes 
annotated to these loci encode known eosinophil products, and also 
implicate phospholipid inflammatory mediators, specific transcrip- 
tion factors and mitochondrial proteins. We confirmed that methy- 
lation at these loci differed significantly in isolated eosinophils from 
subjects with and without asthma and high IgE levels. The top three 
loci accounted for 13% of IgE variation in the primary subject panel, 
explaining the tenfold higher variance found compared with that 
derived from large single-nucleotide polymorphism genome-wide 
association studies**. This study identifies novel therapeutic targets 
and biomarkers for patient stratification for allergic diseases. 
Asthma, atopic dermatitis (eczema) and hay fever are IgE-related 
diseases that are increasing in prevalence and are a major source of 
disability. Systematic knowledge of IgE production is limited, beyond 
the regulation of IgE creation in B cells by interleukin (IL)-4 released 
from T},2 cells and eosinophils’. Genome-wide association studies show 
polymorphisms in STAT6, FCERIA, the IL4/RAD50 locus and the 
major histocompatibility complex (MHC) to be associated with high 
IgE concentrations**, but these single-nucleotide polymorphisms (SNPs) 
combined account for only 1-2% of the variation in serum IgE’. 
CpG methylation is associated with gene silencing and the pattern- 
ing of gene expression that determines cellular types and functions'*, 


Table 1 | Subject characteristics 


and islands of CpG (CGIs) sequences are positioned near the promoters 
of 40% of human genes”. IL4 expression has been related to upstream 
epigenetic variation in DNA methylation in T cells'®, encouraging us 
to search genome wide for other CGIs associated with IgE serum 
concentrations. 

We used Illumina HumanMethylation27 arrays to target individual 
CpG sites (loci) within proximal promoter regions of 14,475 genes. 
The panel is enriched for genomic regions regulating expression, but 
does not cover all functionally important CpG sites. We excluded from 
downstream analyses any loci with SNPs overlapping the Illumina 
probe sequence, and established that direct bisulphite pyrosequencing 
correlates robustly with the array used in this study (Extended Data 
Fig. 1) and elsewhere". 

We investigated nuclear families from the Medical Research Council 
Asthma (MRCA) panel, in which we have previously carried out genome- 
wide SNP association studies for IgE levels and asthma’’. The panel 
contained 355 subjects (183 male) with a mean age in children of 12.2 
years (ranging from 2 to 39 years) and in adults of 42 years (27 to 61 years) 
(Table 1). One-hundred and thirteen children had doctor-diagnosed 
asthma. We found replication cohorts in 149 Caucasian subjects selected 
equally from the top and bottom deciles of IgE distribution in 1,614 
unselected volunteers for the Poblogaeth Asthma Prifysgol Abertawe 
(PAPA) study (students and staff from Swansea University)"; and in 
160 subjects in an asthmatic family panel from the Saguenay-Lac-Saint- 
Jean region (SLSJ) of Québec”, with a mean age in children of 16 years 
(ranging from 5 to 50 years; 40 with doctor-diagnosed asthma) and 
adults of 44 years (31 to 79 years) (Table 1). 

We fitted models with log.-normalized IgE (In(IgE)) as a dependent 
variable and methylation status for each Illumina probe as a predictor, 
with age, sex, parental status, interactions and batch identifiers as cov- 
ariates. We identified 34 loci with a false discovery rate (FDR) < 0.01 
(Fig. 1 and Supplementary Table 1) in 32 different CGIs in the MRCA 


MRCA (discovery) 
Number 355 
Age (mean (range)) 28 (2-61) 
N (%) female 17 (48.5%) 
N (%) asthmatic 175 (49.3%) 
N (%) smoker 45 (12.7%) 
Eosinophil count (mean + s.e.m.) per ul 406 + 383 


Geometric mean serum IgE (range) (IUI7!)+ 320 (1-4,999) 


s.e.m., standard error of the mean. 
* Normal range, <350 cells per ul. 
+ Normal range, <1001U! “1 


PAPA (1st replication) SLSJ (2nd replication) 


149 160 
21 (18-30) 29 (5-79) 

72 (48.3%) 80 (50.0%) 
34 (22.8%) 69 (43.1%) 
33 (22.1%) 28 (17.5%) 
246 +214 242 + 205 


663 (0-18,800) 412 (2-7,653) 
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Figure 1 | Manhattan plot of the results of the genome-wide methylation 
association study. a, b, The results of genome-wide association testing to CGIs 
are shown for 27,000 loci in 355 subjects from the MRCA panel of families. 
a, The QQ plot showing observed versus expected —log)9(P values) for 


panel. After replication in the PAPA and SLSJ panels, a meta-analysis 
combining the results identified 36 loci with FDR < 10 “ and 62 loci 
with an FDR < 0.005 (Table 2 and Supplementary Table 1). All loci showed. 
associations with the same anti-correlated direction in the three data 
sets (Table 2). A radioallergosorbent test (RAST) index, quantifying 
IgE against common allergens’’, showed similar but non-independent 
associations, suggesting common regulation of total and specific IgE. 
Testing of models with asthma as the dependent variable showed only 
LPCAT2 (also known as AYTL1) and ZNF22 to be associated with asthma 
independently of IgE levels (P = 7.7 X 10-° and P= 1.8 X 10“). 
The variable methylation site upstream of IL4 has a well-studied 
effect on IL-4 production’*” and IgE regulation, with methylation anti- 
correlated with expression in the same direction as in our study. We 
looked for SNP associations at this locus by imputation with the 1000 


T T ei eT a 
15 16 17 18 192021 X ¥. 


Chromosome 


association at all loci. b, Manhattan plot showing chromosomal locations of 
—logio(P values) for association at each locus. The red line illustrates the 
threshold for an FDR < 0.01. 


Genomes project phase 1 SNPs and indels in all three panels, analysing 
the 20,746 variants within 1 megabase upstream or downstream of the 
IL45’ untranslated region. We found no significant SNP associations 
with IgE after accounting for multiple testing. 

We carried out Mendelian randomization to test for a causal effect 
of IL-4 methylation on IgE"’, choosing the SNP showing the strongest 
association to methylation at the IL-4 CpG probe (cg26787239) as the 
instrumental variable. The first stage F-test statistics for the MRCA 
and SLSJ panels (F = 16.4 and 26.2) indicated effects strong enough to 
ensure the validity of the method. In the MRCA panel, the association 
between the instrument SNP (rs12311504) and IgE before adjusting 
for IL-4 methylation was P = 0.03, and P= 0.53 after adjustment, 
indicating that methylation mediated most of the SNP effect. The meta- 
analysis P for a causal effect was 6.8 X 10 *, suggesting that the locus 


Table 2 | Meta-analysis of association of total serum IgE concentration in three subject panels 


Probe Symbol Function MRCA (P value) SLSJ (P value) PAPA (P value) Meta-analysis (P value) 
cg01998785 LPCAT2 Lysophospholipid metabolism 1.2x10°8 8x10% 9.6 x 10°° 1.2 x 10°18 
cg10159529 IL5RA Cytokine signalling 5.1 x 10712 2.1x10% 7.2x10°° 2.2 x 10718 
cg01614759 ZNF22 Transcription factor 44x 10°12 3.4x10% 2.8 x10°° 2.8 x 10°18 
cg15996947 L2HGDH itochondrial oxidoreductase 74 x10°18 x10 3.7 x10°5 2.8% 10°77" 
cg26787239 IL4 Cytokine signalling 1.6x1071 3x10% 48 x104 3.2 x 10°16 
cg18783781 SLC25A33 Mitochondrial transport: dendritic-cell endocytosis 5.0 x10 3.4x107 6.2 x10% 44x1071!§ 
cg13221796 RB1 Transcription factor 5.7 x10°1° 6.0 x10? 49 x10°° 2.5x10°14 
cg01770400 SERPINC1 Anti-thrombin 6.6 x 10°14 7.0 x10°3 1.3% 10-2 53°x107*4 
cg02643667 TFF1 Mucus-stabilizing secreted protein 7.9 x 10°14 3x107 2.6 x 10-3 7.6 x10°18 
cg21627181 SLC17A4 Sodium/phosphate co-transporter 1.1 x 10° 4x10~* 3.8x104 1x10°' 
cg20189937 L2HGDH Mitochondrial oxidoreductase 1.3 x10°° 24x103 32x10°% 26x10" 
cg26457013 TMEM86B Lysoplasmalogenase: phospholipid metabolism 1.0 x 10° 9.5 x102 9.0 x 10-4 7.1x10°12 
cg20503329 COL15A1 Cell shape, motility, adhesion 1.2 x 10° 3.0 x107 88 x10°% 9.5 x10}? 
cg03693099 CEL Secreted carboxyl ester lipase 1.8 x 108 3¢ 1054 8.3 x10°% 3x101 
cg00079056 SPINK4 Serine peptidase inhibitor 1.5 x1077 64x10? 3.1 x 1075 8x 10744 
cg09676390 ADARB1 Pre-mRNA editing of the glutamate receptor 1.2 x10°8 7.0 x10 8.0 x 10-4 3.1x1071! 
cg15998761 MFSD6 MHC receptor homologue 9.3 x10°7 3x10? ‘71074 45x1071! 
cg25494227 TMEM52B Transmembrane protein 1.3 x1077 2.0 x 107! 210°? 5.110774 
cg11398517 FAM112A Unknown 2.4 x 10° 74 x10°3 3.3 x10"4 0 x10-1° 
cg06690548 SLC7A11 Cystine/glutamate antiporter: dendritic-cell 2.7.x 10-8 3.3 x104 1x10% 8 x10°1° 
differentiation 

cg17784922 KEL Metallo-endopeptidase 4.2 x107’ 7.9 x 10-3 44 x10°3 2.1 x1071° 
cg16050349 PIK3CB Catalytic subunit for PI3K-B: activation of neutrophils 4.0 x 10°° 1.7 x 10-3 2.3 x104 3.2 x 101° 
cg25636075 TMEM41A Transmembrane protein 2.5x104 5.1 ¥107° 7.7 x10~* 3.9 x 10°19 
cg08404225 IL5RA Cytokine signalling 2.3 x10+ 3.3 x10% 8.4x10°° 4.1 x10°1° 
cg09447105 PDE6H nhibitory subunit of CMP phosphodiesterase 22-107" 1.3 x1077 4.0 x 10-4 5.3 x10°1° 
cg05215575 SEPT12 Cell shape, motility, adhesion 3.11077 2.1 x107! 2.7 x10~* 2x10? 
cg26136776 KLF1 Erythroid-specific transcription factor 33x10 43x10} 6.6 x 10~* 5x10? 
cg17749520 ITGA2B Platelet fibronectin receptor: role in coagulation 1.4 x10°° 2.5 x10 3.5 x10 8x10? 
cg24459209 PRG3 Eosinophil major basic protein homologue 3.3 x 10° 2.9x10? 1x10% 8x10? 
cg00002426 SLMAP Sarcolemma-associated protein 7.9x107-° 8.3 x10% 6 x10~* 24x10? 
cg15357945 PRG2 Eosinophil granule major basic protein 2.2 x10°3 2.8 x10°° 5.8x10-% 3.1 x10°° 
cg17582777 EFNA3 Receptor protein-tyrosine kinase 1.1104 3.1 x 10-2 8.2 x10°% 8.6 x 10°? 
cg19881895 SLC43A3 Transmembrane protein 75x10 2.8x10% 6.7 x10 6x108 
cg 18254848 CLC Lysophospholipid metabolism 1.8x10-5 44 x102 4.6 x10°3 4.5 x108 
cg21631409 ALDH3B2 Enzyme or kinase 2.3 x10+ 1.7 x 10-2 2.x 107-5 68 x108 
cg00536175 GATA1 Eosinophil-specific transcription factor 79 x10°8 400x107! 51x10? A107 


Loci with an FDR for the meta-analysis <10° * are shown: a full list of significant associations can be found in Supplementary Table 1. MRCA contained 355 subjects, PAPA 149 and SLSJ 160. Markers are identified 
through their Illumina identification numbers and the associated gene symbol is derived from the Illumina annotation updated through PubMed. Note that two probes from /L5RA and from L2HGDH are associated 


to IgE concentrations. 
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represents a functionally validated epigenetic association with a com- 
plex phenotype. 

Several loci were annotated to genes that encode proteins characteristic 
of eosinophils (Table 2 and Supplementary Table 1). IL5RA encodes a 
receptor that selectively stimulates eosinophil production and activation”; 
CCR3 encodes the eosinophil eotaxin receptor; ILIRL1 encodes the recep- 
tor for the eosinophil-activating cytokine IL-33; PGR2 (also known 
as GPR142) encodes eosinophil granule major basic protein (PRG2); 
PGR3 (also known as GPR139) is a PGR2 homologue; and GATA1 is an 
eosinophil transcription factor. We therefore tested whether methyla- 
tion at our associated loci marked activation in eosinophils purified 
from peripheral blood, studying eight asthmatics with high serum IgE 
levels (>110 international units (IU)1~'), eight asthmatics with low 
serum IgE (<110IUI ') levels, and eight controls (mean age of all sub- 
jects 31 years (range 6-56 years), eight females and two current smo- 
kers). Asthmatics in both groups were on a maintenance regime of 
inhaled B-agonists, augmented with inhaled glucocorticoids during 
exacerbations. 

We observed the lowest levels of methylation in the subjects with 
asthma and high IgE, and that methylation in asthmatics with low IgE 
was intermediate to controls (Fig. 2) (P < 0.05; Supplementary Table 1), 
supporting our initial results. Partitioning the data into high or low IgE 
groups led to similar conclusions. The range of variation for the prin- 
cipal loci was narrower in asthmatics with high IgE (Fig. 2) than in the 
other two groups, suggesting the enrichment ofa distinctive eosinophil 
subset in atopic asthma. This is consistent with the recognized mixture 
of eosinophil populations in human blood and eosinophil activation in 
atopic disease”®. Comparison of methylation levels in the MRCA panel 


LPCAT2 (cg01998785) 


IL5RA (cg10159529) 


for our IgE-associated loci (P< 0.001) with an independent study of 
eosinophils isolated from normal subjects”' confirmed correlations with 
methylation status (R = 0.64). 

Lineage commitment to particular cell types is accompanied by spe- 
cific methylation changes’, and it has been suggested that DNA from 
mixtures of cells (such as peripheral blood leukocytes (PBLs)) will not 
support epigenome-wide association studies of complex diseases*’. We 
extracted DNA from unfractionated populations of PBLs so that our 
methylation patterns reflect the numbers and the activity of different 
cells in each specimen. We further explored whether our associations 
to IgE reflected carriage in particular cell types by fitting regression 
models that included differential white cell counts. We identified partial 
associations with eosinophil numbers for all IgE-associated loci (Sup- 
plementary Table 2 and Fig. 3), consistent with independent effects on 
IgE from the numbers of eosinophils and the activity of the loci within 
eosinophils. 

It is well recognized that the regulation of IgE production against 
particular antigens may reside in T cells and B cells as well as in eosino- 
phils”. However, our regression models found that the top IgE associa- 
tions were not accounted for by concomitant correlation with lymphocyte 
counts (Supplementary Table 2). We further examined the distribution 
of IgE-associated CpG loci in leukocyte subsets we isolated and in 
subsets from published data”. The results showed robustly in both 
data sets that low levels of methylation at the IgE-associated loci were 
confined to eosinophils (Extended Data Fig. 2). 

Surrogate CpG markers that identify lymphocyte subsets can be used 
as an alternative to white cell counts in association models”. We also 
applied these methods to our data (Extended Data Table 1). This analysis 
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Figure 2 | Boxplots of methylation at selected CpG loci in isolated 
eosinophils from subjects with and without asthma and high total serum IgE 
concentrations (>110IUI™'). Subjects were recruited from the SLSJ 
population. Methylation (/) is shown ona scale of 0-1 for eight subjects in each 
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group. Boxplots show means and interquartile ranges. The intensity of the data 
point colour is proportional to total serum IgE. All loci exhibited reduced 
variability and levels of methylation in the subjects with asthma and high IgE 
(P<0.05). 
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Figure 3 | Association of selected CpG loci to total serum IgE 
concentrations in the MRCA panel, partitioned by eosinophil counts. 
Methylation values for 355 individuals normalized around a mean of 0 on the 


provided further evidence that T-cell subsets do not have strong effects 
on these loci. 

The variance (standard deviation) in our IgE-associated CpG loci 
was on average 4.4-fold larger in isolated eosinophils than in PBLs from 
the MRCA data set, indicating an attenuation of effect size in PBLs that 
would mask associations rather than magnify them. The power to detect 
cell-specific associations from PBLs depends on the proportion of each 
cell type, the effect size in specific cells, and the sample size. We esti- 
mated that we had 90% power to detect loci accounting for 10% of 
variance in IgE in the MRCA panel and >99% power in the combined 
panels (Extended Data Fig. 3). Although our ability to detect associa- 
tions was enhanced by eosinophil counts that were above the normal 
range for many of our subjects (Table 1), the power estimates and the 
large observed effect size encourage epigenome-wide association stud- 
ies of other diseases in which PBLs may be important. 

We investigated the variance attributable to different loci in the MRCA 
panel through a stepwise regression that included all significant CGI 
associations together with differential white cell counts, age, sex and 
parental status. Using a cut-off of corrected P< 0.1 for inclusion, we 
found that SLC25A33, LPCAT2 and L2HGDH predicted the serum IgE 
concentration independently of each other and of eosinophil counts. 

In the MRCA panel, the top three CpGs independently explained 
13.5% of IgE variation and the eosinophil counts explained a further 
8.8%. In the SLSJ panel the top three CpGs explained 8.3% IgE variation 
and eosinophil counts a further 15.5%. (We were not able to estimate 
variances meaningfully in the PAPA data set, because the samples were 
selected by extreme IgE values). The regression models therefore matched 
the results from isolated eosinophils, leading us to the conclusion that 
the methylation status of eosinophils and their numbers were both 
related to IgE levels. 

Methylation levels were highly correlated between loci, and similar 
estimations of variance were obtained with forced entry of other signi- 
ficantly associated markers, so the results do not imply that SLC25A33, 
LPCAT2 and L2HGDH are the most important loci. DNA methylation 
is not meiotically heritable and the variance in IgE attributable to these 
loci does not impact on the problem of missing heritability. 

Overall, the most significant association was to cg01998785, within a 
CGI adjacent to LPCAT2. LPCAT2 encodes lyso-platelet-activating 
factor (PAF) acetyltransferase, which is essential to induced formation 
of PAF, a potent pro-inflammatory lipid mediator™. It is of interest 
that hypoactive variants of plasmatic PAF-acetylhydrolase are assoc- 
iated with atopy and asthma”. Other significant associations annotated 
to genes involved in phospholipid metabolism included lysoplasmalo- 
genase (TMEM86B), CEL (carboxyl ester lipase) and CLC. 

GATA1 is a known eosinophil transcription factor and subsequent 
investigations will determine if the other associated transcription fac- 
tors ZNF22, RB1 and KLFI regulate eosinophil activation. Other associa- 
tions may encode proteins released from eosinophil granules, including 


abscissa (x) with In(IgE) levels on the ordinate. Blue dots indicate subjects with 
eosinophil counts greater than the median for the MRCA panel. 


PRG2, PRG3, SERPINCI (antithrombin), TFF1 (which may protect the 
mucosa), CEL and the polyvalent serine protease inhibitor SPINK4. 
Genes encoding mitochondrial proteins (L2HGDH and SLC25A3) are 
consistent with mitochondrial suppression of apoptosis in activated 
eosinophils”. 

Although the relationship between methylation and gene expression 
at these loci requires further investigation in isolated cells, our results 
support the recognition that eosinophils are an important source of 
cytokines and other pro-inflammatory molecules at the site of allergic 
inflammation’. Eosinophils are required locally for the maintenance of 
bone-marrow plasma cells”, allowing direct regulation of IgE produc- 
tion in specialized environments. Clinically, the presence of eosinophi- 
lia in the peripheral blood or airways identifies a subgroup of refractory 
asthmatic individuals in whom therapies directed at eosinophils may 
be effective”. The measurement of methylation at these loci may fur- 
ther identify patients responsive to therapies directed at eosinophils or 
individual gene products. 

Cigarette smoking may increase serum IgE, and we found anti-correlated 
associations to current cigarette smoking with F2RL3 (P = 8.6 X 10") 
and GPRI5 (P=4.6X 10 °). The SLSJ data set confirmed these 
associations (P=2.5 X10 ° and P=6.6 X10’), in keeping with 
previous studies”. Adjusting for smoking had minimal impact on 
the top hits for IgE and neither locus affected IgE in our subjects. F2RL3 
and GPRI5 may represent therapeutic targets to counter the effects of 
tobacco smoke and their methylation status may prospectively predict 
consequences of smoking. 

Our epigenome-wide association study has discovered reproducible 
CGI associations accounting for a variation in the total serum IgE that 
is tenfold higher than that derived from large SNP genome-wide asso- 
ciation studies’. In contrast to SNP studies, association to methylation 
levels captures responses to environmental factors and the loci should 
not be assumed to cause disease. Nevertheless, our findings suggest the 
presence of novel therapeutically tractable pathways underlying IgE 
production. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Phenotyping. Ethical approval for the study was obtained from the NHS Multicentre 
Research Ethics Committee for the MRCA subjects; from the Swansea Joint Scientific 
Research Committee and Swansea Research Ethics Committee for the Swansea 
(PAPA) subjects; and from le Centre de Santé et des Services Sociaux de Chi- 
coutimi for the SLSJ families. Written informed consent was obtained from all 
subjects or in the case of children, from their parents. Asthma was doctor defined. 
Following a standard respiratory questionnaire, all subjects submitted to venipunc- 
ture. Differential white cell counts were measured by automated counter. Total 
serum IgE and specific serum IgE to whole HDM (Dermatophagoides pteronyssinus) 
and Timothy grass pollen (Phleum pratense) were measured using the Immunocap 
FEIA (Pharmacia AB). The levels of specific IgE were converted to RAST units 
according to Pharmacia recommendations. A combined RAST index was calcu- 
lated for each individual as the sum of the RAST scores to HDM and Timothy 
grass’. 

Detection of methylation status. DNA was extracted after red cell lysis and 
centrifugation to recover leukocyte nuclear pellets. DNA samples were bisulphite 
converted using the Zymo EZ DNA Methylation kit (Zymo Research) with an 
input of 1,000 ng. The assay was carried out as per the Illumina Infinitum Methylation 
instructions, using the HumanMethylation27 BeadChips (Illumina). These inter- 
rogate 27,578 of CpG sites for the extent of DNA methylation. Data were visualized 
using the BeadStudio software, and samples that failed quality control were repeated. 
Raw methylation data were exported from the GenomeStudio software. For the 
Illumina HumanMethylation27 BeadChip data, quantile normalization of intens- 
ity was applied to all methylated and unmethylated probes for all samples together. 
The methylation f values were recalculated as the ratio of methylated probe signal/ 
(total signal + 100). The Touleimat and Tost” analysis pipeline was used for the 
HumanMethylation450 BeadChip. Individual data points with detection P > 0.01 
or number of beads <3 were treated as missing data, as were samples with more 
than 20% missing probes. The lumi package” was used for background and colour 
bias correction. BeadChip ID and position on chip were included as categorical 
covariates to account for potential batch effects. Quantile normalization across 
samples was applied to probes within each functional category (CpG island, shelf, 
shore, and so on) separately to correct the shift of methylation f values between 
Infinium I and Infinium II probes on the HumanMethylation450 BeadChip. Probe 
overlaps with any frequent SNP (minimum allele frequency (MAF) > 5% in 1000 
Genomes Project phase 1 EUR population) in the probe sequence or in position 
+1 or +2 of the query site (depending on Infinium I or Infinium II status) were 
removed. The use of meta-analysis to combine 27K and 450K data together with 
this implementation of the Tost pipeline ensured our analysis was not confounded 
by probe differences. 

Isolation of human eosinophils. Isolation of human eosinophils was as described*’. 
Briefly, platelet-rich plasma was removed from 200 ml using centrifugation, fol- 
lowed by Dextran-mediated sedimentation to remove erythrocytes and removal of 
mononuclear cells using a lymphocyte separation medium. Hypotonic lysis with 
sterile water removed remaining erythrocytes and other granulocytes were removed 
using negative selection with anti-CD16 MicroBeads. DNA was extracted using the 
QlIAamp DNA Blood Mini Kit. Methylation was assessed using Illumina 450K arrays, 
with analysis restricted to significantly associated probes from the meta-analysis. 
Statistical analyses. To investigate the association with the total serum IgE con- 
centration we tested for association with log-normalized IgE (In(IgE)) as response 
with methylation (f) at each locus as predictor while including sex, age, parent 
indicator, age X sex and age X parent interactions in the model, together with 
batch indicators captured by Illumina chip ID and position of chip (such as 
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operators, sample wells, plates, runs, and reagents). We applied inverse normal 
transformation to methylation measures to remove the effect of outliers. We used 
the R function Ime() in the nlme package to implement a linear mixed model, 
assuming a compound symmetry variance-covariance structure to account for 
correlation of phenotypes among family members. The R code for the discovery 
stage of association in the MRCA panel was: 


index = !is.na(methylation) 

fam = familyID[index] 

par = parent[index] 

methylation = methylation [index] 

methylation = qnorm(rank(methylation)/(length(methylation)+ 1),mean = 0,sd = 1) 
Inige = LNIGE[index] 

age = AGE[index] 

sex = SEX[index] 

lm2 = Ime(Inige~sex+age+ methylation+ par+ sex*age+ age*par,random = ~ 1 


|fam) 


The residual methylation values after removal of the effects of chip ID and 
position for the genome-wide significant loci in the MRCA, PAPA and SLSJ panels 
are provided in Supplementary Tables 3-5, together with phenotypic and covari- 
ate parameters. We calculated FDRs and applied Bonferroni corrections to adjust 
for multiple comparisons to 27,578 probes. The same analyses were carried out in 
the PAPA and SLSJ subjects before meta-analysis of the three data sets. We use a 
weighted z-score method for meta-analysis based on P value and effect direction 
from individual studies with weights proportion to the square root of the sample 
size of the individual study**. SNPs and indels from the 1000 Genomes Project 
phase 1 release (14 March 2012 haplotypes) were imputed using MINIMAC*. 
SNPs or indels with imputation quality score R’ < 0.3 were removed from down- 
stream analysis. We carried out Mendelian randomization to assess the causal 
effect of IL-4 methylation on IgE level through a two-stage least squares instrumen- 
tal variable regression*® implemented in the ivreg2.r program (http://diffuseprior. 
wordpress.com/2012/05/03/an-ivreg2-function-for-r/). We tested association 
trends in isolated eosinophils by exact regression (Cytel Studio 9) with asthma/ 
high IgE coded as 2, asthma/low IgE coded as 1, and controls as 0. Covariates for 
age, sex and batch were included in the model and to test the hypothesis that low 
levels of methylation were associated with high IgE, P values were one-sided. Dif- 
ferences in methylation between peripheral blood leukocyte subsets were assessed 
with Kruskal-Wallis tests, using two-sided P values. 
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Extended Data Figure 1 | Concordance in methylation status at IgE- 
associated loci when comparing whole-genome bisulphite sequencing with 
the Illumina platform. These results were produced by us (E.G. and T.M.P.) at 
the Genome Québec Innovation Centre. The figures show a comparison 
between IgE-associated CpG probes using Illumina 450K (x-axis) and 
whole-genome bisulphite sequencing (WGBS) (y-axis) platforms for two 
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samples (left and right panels) with 20-fold sequence coverage. The results 
show a high R* between platforms (0.76 and 0.73). The median of the 
correlation coefficients for our IgE-associated loci across 30 different samples 
(using WGBS at various depths) was R° = 0.76. This can be compared with the 
global assessment of all overlapping 450K sites, which is R” = 0.81. 
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Extended Data Figure 2 | Distribution of methylation status at IgE- (CD56+NK); CD8* T cells (CD8+T); granulocytes (Gran); neutrophils (Neu) 
associated loci in isolated leukocyte subsets. a-I, The figure shows the and peripheral blood mononuclear cells (PBMCs). b, d, f, h, j, 1, Results from 


distribution of methylation in PBL subsets at the most strongly IgE-associated _ cells isolated and analysed by us at the McGill University Genome Québec 
loci. CpG methylation was measured by the Illumina Infinium 450K platform. Innovation Centre (MUGQIC). Eosinophils (Eos) (from 24 subjects in the SLSJ 
Boxplots show means and interquartile ranges. a, c, e, g, i, k, Results from panel) also show lower levels of methylation with wider variation compared 
publically available data derived from six healthy controls”!. Lower levels of to whole blood (22 SLSJ subjects), and to subsets including B cells (BC; 9 control 
methylation with wider variation are observed in eosinophils when compared _ subjects), monocytes (Mono; 76 control subjects) and T cells (TC; 74 control 
to whole blood (WB) and subsets comprising CD14* monocytes (CD14+M); _ subjects). KW Test, Kruskal-Wallis one-way analysis of variance. 

CD19* B cells (CD19+B); CD4* T cells (CD4+T); CD56" natural killer cells 
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Extended Data Figure 3 | Power estimations to detect eosinophil-specific 
effects in DNA from PBL. The figure shows that our original MRCA data set 
(green line) and our combined data set (blue line) are well powered to detect 
signals of the magnitude observed in our three groups of subjects. The red line 
shows the power of a sample size of six, as described previously”’, to detect 


differences in CpG methylation in unfractionated PBLs. The mean variance (as 
standard deviation (s.d.)) for the IgE-associated loci was 0.036 in PBLs from our 
primary MRCA panel and 0.023 in the whole blood normal samples from 
ref. 21, demonstrating that our results were consistent with the previous 
experiment. 
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Extended Data Table 1 | Comparison of surrogate variable analyses with direct white cell counts in association models 


a) Before adjusting cell counts b) Adjusting for Houseman cell proportions c) Adjusting for white cell counts 
Chr. Position Symbol probe _pMethy  ‘pMethy pCD81 pCD4T pNK  pBCELL pMON pGRAN  ‘pMethy pEOS pNEU | pLYM pMON  pBAS 
TT 9521654 SLOZOA33 Cg18/8a/81  4.90E-14  5.58E-13 0.3020663 0.646914 0,0630001 0.0730391 0.77 15233 0.2564409  6.14E-06 0.00005 0.800001 0.676606 0.561016 O.c01/o2, 
16 54100210 LPCAT2 eg01998785 =. 1.24E-13 3.81E-13 0.1560292 0.3633042 0.034881 0.0300831 0.7239839 0.1881862 2.36E-06 0.0003 0.294806 0.300248 0.445243 0.785135 
14 49849865 L2HGDH ~—cg15996947 = 7.39E-13 3.81E-11 0.3709841 0.6823007 0.125117 0.1583239 0.9387673 0.3768824 3.21E-05 0.000295 0.38062 0.519495 0.662394 0.890493 
10 44815441 ZNF22 eg01614759 =4.42E-12 9.09E-11 0.3526504 0.6420391 0.1382946 0.0810812 0.654088 0.3845248 7.61E-05 0.000216 0.305013 0.42021 0.601202 0.937252 


3 3127530 IL5RA eg10159529 5.06E-12 1.70E-10 0.8786628 0.9388517 0.2941882 0.2701973 0.9752202 0.5129874 0.000104 6.81E-05 0.877227 0.802458 0.551635 0.756105 
1 172153108 SERPINC1 cg01770400 6.59E-12 1.54E-11 0.1182883 0.3063038 0.0323251 0.0368702 0.574556 0.1896 6.36E-06 0.000109 0.236406 0.186757 0.667289 0.90807 
21 42659768 TFF1 cg02643667 7.87E-12 6.81E-11 0.234861 0.4646176 0.0330799 0.0706689 0.797703 0.1753888 9.07E-06 7.18E-06 0.384611 0.725574 0.672431 0.976409 
5 132036424 iL4 cg26787239 = 1.59E-11 2.13E-10 0.5058003 0.9922092 0.1723318 0.1996409 0.7364825 0.6544331 1.25E-05 1.66E-05 0.269226 0.259295 0.614387 0.77939 
2 110014086 LIMS3 cg18879041 4.06E-11 2.57E-09 0.3708111 0.7030526 0.1613161 0.2166898 0.9745824 0.474026 0.000865 0.000256 0.280498 0.382954 0.500203 0.963467 
13 47774920 RBI cg13221796 5.66E-10 9.26E-11 0.1278218 0.3641314 0.0353766 0.0458363 0.7786051 0.2987393 2.84E-06 1.83E-05 0.029037 0.081617 0.516731 0.982861 


19 60432000 TMEM86B cg26457013 = 1.02E-09 4.08E-09 0.3305229 0.5920938 0.036091 0.0792481 0.9078355 0.2711535 0.000691 1.35E-05 0.363948 0.317448 0.558421 0.740734 
9 100745613 COLT5A1 cg20503329 = 1.19E-09 3.14E-08 0.4772819 0.7388277 0.1359031 0.1343368 0.9433195 0.4284566 0.001341 1.17E-05 0.443838 0.49736 0.401174 0.959364 


1 31166502 SDC3 cg07689731 2.33E-09 4.79E-09 0.5074834 0.7985815 0.0774502 0.0726484 0.7924958 0.4108301 0.002906 1.48E-05 0.415182 0.483806 0.437372 0.932638 
21 45317773 ADARB1 cg09676390 1.21E-08 1.32E-07 0.43369 0.6702876 0.0791369 0.1103603 0.9234331 0.3292778 0.000472 3.48E-07 0.432091 0.530921 0.462608 0.820656 
9 134926722 CEL cg03693099 1.84E-08 1.01E-07 0.2926779 0.512945 0.0818301 0.1071869 0.9754856 0.3848947 0.000383 9.27E-06 0.099207 0.18224 0.575481 0.93957 
19 12859426 KLF4 ¢g26136776 3.34E-08 5.42E-07 0.2328607 0.4764967 0.0498737 0.087521 0.7661611 0.2312155 0.008494 4.25E-06 0.308908 0.378613 0.526765 0.764392 


x 48529968 GATAI cg00536175 7.85E-08 5.70E-07 0.6056632 0.9159691 0.163687 0.1102612 0.8082841 0.5675393 0.002444 1.31E-06 0.230592 0.333849 0.423511 0.745501 
12 10222881 TMEM52B cg25494227 = 1.29E-07 2.79E-07 0.4810187 0.6746518 0.0457622 0.0735827 0.9017706 0.3386288 8.31E-05 2.05E-08 0.349366 0.362883 0.456694 0.75634 


9 33229641 SPINK4 cg00079056 1.54E-07 1.23E-06 0.3893041 0.6595419 0.067607 0.090288 0.950103 0.3262354 0.015058 9.70E-07 0.327361 0.481372 0.49343 0.779557 
12 15017287 PDE6H cg09447105 2.16E-07 1.80E-06 0.6038815 0.9752817 0.1325361 0.2619219 0.6725442 0.6818338 0.002327 6.73E-07 0.185319 0.308805 0.650588 0.72691 
16 4778723 FLJ25410 cg05215575 3.11E-07 7.40E-08 0.6354446 0.6693211 0.0382164 0.0635217 0.8705757 0.2084314 0.011151 9.08E-08 0.699883 0.891282 0.341595 0.856081 
7 142369547 KEL cg17784922 =4.24E-07 9.07E-07 0.6926023 0.9854571 0.0699431 0.1577229 0.8028496 0.5052827 0.001391 3.21E-08 0.505128 0.698535 0.367309 0.99912 
7 29199993 CHN2 cg21472642 6.51E-07 7.39E-07 0.5225896 0.8312874 0.0708089 0.0764603 0.9022296 0.4725488 8.45E-06 3.38E-09 0.104506 0.231601 0.595919 0.815002 
2 191009029 FLJ20160 cg15998761 9.33E-07 9.09E-06 0.4236314 0.824669 0.1393183 0.1145142 0.8527446 0.4643437 0.043732 5.71E-07 0.331558 0.449501 0.418024 0.785659 
6 25862169 SLC17A4 cg21627181 = 1.14E-06 1.42E-06 0.7663137 0.9149808 0.0591244 0.1414494 0.9511792 0.3972344 0.025202 5.30E-08 0.609342 0.865393 0.467443 0.746032 


14 49849874 L2HGDH ~—cg20189937 =: 11.32E-06 1.16E-05 0.503935 0.9204654 0.0921306 0.1950088 0.8817908 0.5174749 0.007269 1.26E-07 0.297519 0.525586 0.480584 0.958405 
17 39822093 /TGA2B cg17749520 = 1.37E-06 9.24E-06 0.3677214 0.6752543 0.046072 0.1157269 0.8717913 0.3274826 0.009625 3.11E-07 0.286277 0.406727 0.666483 0.903995 
17 39701014 SLC4A1 cg03580247 = 1.51E-06 1.76E-05 0.4966594 0.7030166 0.1566437 0.1201467 0.7706213 0.4809553 9.19E-05 7.97E-09 0.262889 0.282218 0.522557 0.944193 
20 41789039 FAMT12A cg11398517 = 2.42E-06 2.11E-05 0.431888 0.7316467 0.0506171 0.209415 0.8337169 0.4256838 0.006763 1.86E-07 0.220078 0.444385 0.592602 0.918166 
11 56904791 PRG3 cg24459209 3.33E-06 1.29E-05 0.5460024 0.9366639 0.0692143 0.1545826 0.8947187 0.4338765 0.094895 2.69E-07 0.444762 0.62114 0.500854 0.993962 
12 7792928 CLEC4C =. cg22194129 5.46E-06 0.0001208 0.4977092 0.8378959 0.0710935 0.6019842 0.8886589 0.4447889 0.023532 2.38E-08 0.408481 0.759002 0.521453 0.818552 
14 95011802 Cf4orf49 = cg16522484 §=8.04E-06 0.0001668 0.4308412 0.7282314 0.1486239 0.1112817 0.9482318 0.4206034 0.034072 1.01E-07 0.285777 0.449644 0.49125 0.837548 
x 48529554 GATAI 0922543648 9.51E-06 7.12E-05 0.682124 0.9511741 0.1312401 0.1917101 0.6997158 0.4748147 0.02664 2.00E-08 0.585973 0.689124 0.425854 0.888172 


2 11727816 NTSR2 cg25657834 1.02E-05 2.04E-05 0.9354409 0.6331433 0.1373482 0.3261244 0.4920064 0.8015518 5.67E-05 4.84E-10 0.191813 0.526463 0.689339 0.954103 
19 48912054 IRGC cg26251865 1.76E-05 9.19E-06 0.714389 0.9562959 0.0555475 0.1095196 0.6922884 0.4367507 0.011533 1.35E-08 0.495745 0.567965 0.468957 0.977824 
19 44919789 CLC cg18254848 = 1.77E-05 2.33E-05 0.7122613 0.874791 0.0834931 0.1954486 0.6341788 0.5277066 0.161753 4.77E-08 0.462599 0.67872 0.424877 0.937408 
6 10635801 GCNT2 cg26385286 =: 1.78E-05 3.88E-05 0.5747779 0.9758809 0.0654066 0.2046998 0.6574459 0.5308099 0.070861 4.31E-08 0.339931 0.593173 0.451377 0.902264 
3 150064527 CPA3 cg13424229 —1.89E-05 2.29E-06 0.4329161 0.6962458 0.1048729 0.0674688 0.7084296 0.6214492 0.001236 4.96E-07 0.054043 0.094343 0.451245 0.955257 


a-c, The table shows the P values from regression models in the MRCA panel that predict In(IgE) at each locus before adjusting cell counts (a); after adjusting cell counts using Houseman surrogate variables (b); 
and after adjusting for cell subsets counted in our data (c). pMethyl measures the strength of association to IgE. CD8, CD4 and NK are T-cell subsets. All models are adjusted for sex, age, methylation, parent/child 
status, sex X age interaction, and age X parent interaction. BAS, basophils; EOS, eosinophils; GRAN, granulocytes; LYM, lymphocytes; MON, monocytes; NEU, neutrophil. 
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The ability to differentiate stimuli predicting positive or negative 
outcomes is critical for survival, and perturbations of emotional 
processing underlie many psychiatric disease states. Synaptic plas- 
ticity in the basolateral amygdala complex (BLA) mediates the 
acquisition of associative memories, both positive’ and negative*’. 
Different populations of BLA neurons may encode fearful or 
rewarding associations*°, but the identifying features of these 
populations and the synaptic mechanisms of differentiating positive 
and negative emotional valence have remained unknown. Here we 
show that BLA neurons projecting to the nucleus accumbens (NAc 
projectors) or the centromedial amygdala (CeM projectors) undergo 
opposing synaptic changes following fear or reward conditioning. 
We find that photostimulation of NAc projectors supports positive 
reinforcement while photostimulation of CeM projectors mediates 
negative reinforcement. Photoinhibition of CeM projectors impairs 
fear conditioning and enhances reward conditioning. We character- 
ize these functionally distinct neuronal populations by comparing 
their electrophysiological, morphological and genetic features. 
Overall, we provide a mechanistic explanation for the representation 
of positive and negative associations within the amygdala. 

The BLA, including lateral and basal nuclei of the amygdala", 
receives sensory information from multiple modalities’*“*, and 
encodes motivationally relevant stimuli'*”’. Partially non-overlapping 
populations of BLA neurons encode cues associated with appetitive or 
aversive outcomes*”. The acquisition of the association between a 
neutral stimulus and an aversive outcome such as a foot shock has 
been shown to induce long term potentiation (LTP) of synapses onto 
lateral amygdala neurons**, mediated by postsynaptic increases in 
a-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor 
(AMPAR)-mediated currents*’* in a N-methyl-D-aspartate receptor 
(NMDAR)-dependent manner”. Similarly, increases in glutamater- 
gic synaptic strength of inputs to BLA neurons are necessary for the 
formation of a stimulus-reward association’. Yet the similarity in 
neural encoding and synaptic changes induced by learning a positive 
or negative association and the contrasting nature of the ensuing out- 
puts (reward-seeking or fear-related behaviours) presents an ostens- 
ible paradox: how is it possible that potentiation of synapses onto 
neurons in the BLA can underlie learned associations that lead to such 
different behavioural responses? 

One hypothesis is that BLA neurons project to many downstream 
regions, including the canonical circuits for reward and fear'**°, and 
the neurons that project to different targets undergo distinct synaptic 
changes with positive or negative associative learning. For example, 
BLA projections to the NAc have been implicated in reward-related 
behaviours'*”!”, while BLA projections to the CeM have been linked 
to the expression of conditioned fear***°. However, the unique syn- 


aptic changes onto projection-identified BLA neurons have never 
been explored. 

To investigate this, we selected the NAc and CeM as candidate target 
regions and examined the synaptic changes onto either NAc-project- 
ing BLA neurons (NAc projectors) or CeM-projecting BLA neurons 
(CeM projectors) following fear conditioning or reward conditioning 
(Fig. 1). To identify the projection target of BLA neurons, we injected 
retrogradely travelling fluorescent beads (retrobeads) into either the 
NAc or CeM to label BLA neurons projecting axon terminals to these 
regions (Fig. la and Extended Data Fig. 1). After retrobead migration 
upstream to BLA cell bodies, we trained mice in fear or reward con- 
ditioning paradigms wherein a tone was paired with either a foot shock 
or sucrose delivery. Mice in reward conditioning groups were food 
restricted 1 day before the conditioning session to increase motivation 
to seek sucrose (Extended Data Fig. 1). AMPAR/NMDAR ratio, a 
proxy for glutamatergic synaptic strength, increases after either fear 
or reward conditioning in the BLA'’*'*®. We used matched experi- 
mental parameters across groups in an acute slice preparation stimu- 
lating axons arriving via the internal capsule and performing whole- 
cell patch-clamp recordings in retrobead-identified NAc projectors 
and CeM projectors, which we observed to be topographically inter- 
mingled (Fig. 1b and Extended Data Fig. 2). 

We found that in NAc projectors, fear conditioning decreased the 
AMPAR/NMDAR ratio relative to controls exposed to the same num- 
ber of tones and shocks, but where the tones and shocks were unpaired 
(Fig. 1c). Conversely, following the acquisition of the association 
between a tone and sucrose delivery, synapses on NAc projectors 
showed an increase in AMPAR/NMDAR ratio relative to unpaired 
controls that were also food restricted and received the same number 
of tones and volume of sucrose (Fig. 1d). Importantly, we also included 
naive and food-restricted naive groups (Fig. 1c—f), as food restriction 
itself could alter AMPAR/NMDAR ratio (Extended Data Fig. 3). 

In contrast, synapses on CeM projectors from the paired group 
showed an increase in AMPAR/NMDAR ratio following fear condi- 
tioning, relative to unpaired controls (Fig. le). Following reward con- 
ditioning, CeM projectors from mice that learned the tone-sucrose 
association showed a decrease in AMPAR/NMDAR ratio relative to 
unpaired controls (Fig. 1f). In addition to AMPAR/NMDAR ratios, we 
also examined paired-pulse ratios, and did not detect any differences 
between groups (Extended Data Fig. 3), suggesting a postsynaptic 
mechanism of plasticity. 

These results support a model wherein NAc and CeM projectors 
undergo opposing changes in synaptic strength following fear and 
reward learning, such that relative synaptic strengths onto CeM pro- 
jectors increase following fear conditioning and decrease following 
reward learning. Conversely, relative synaptic strengths onto NAc 
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Figure 1 | Opposite changes in AMPAR/NMDAR following fear or 
reward conditioning in BLA neurons projecting to NAc or CeM. a, After 
injecting retrobeads into NAc or CeM, animals underwent either fear or 
reward conditioning. b, Confocal image of retrobead-labelled BLA neurons, 
with schematic of stimulation and recording sites (left); region in white 
square is enlarged (right). DAPI is shown in blue. c-f, One-way ANOVAs 
were performed on AMPAR/NMDAR ratios collected after conditioning. 
Open circles reflect individual data points, number of neurons are shown in 
each bar and representative traces for each group are below the bar. Results 
show mean and s.e.m. FR, food restricted. c, AMPAR/NMDAR ratio was 
related to training condition (F233 = 5.844, P = 0.0070) and significantly 
lower in the paired fear group relative to the unpaired fear group (t3; = 2.21, 
*P < 0.05). d. AMPAR/NMDAR ratio was related to training condition 
(Fx3, = 6.53, P = 0.0046) and reward learners showed a greater 
AMPAR/NMDAR ratio than mice in the unpaired reward group (tg = 
3.20, **P < 0.01). e, In CeM projectors, AMPAR/NMDAR ratio was 
related to fear conditioning (F229 = 8.72, P = 0.0012) and was greater in 
the paired group relative to the unpaired group (t27 = 3.99, ***P < 0.001). 
f, In CeM projectors, AMPAR/NMDAR ratio was altered by reward 
learning (F,32 = 3.63, P = 0.039), and was lower in learners relative to 
the unpaired group (tz) = 2.57, *P < 0.05). g, Proposed model, arrow 
thickness represents relative synaptic strength. CeM pr., CeM projectors; 
NAc pr., NAc projectors. 
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Figure 2 | Within the BLA, photostimulation of NAc or CeM projectors 
causes positive or negative reinforcement, respectively. a, After rabies virus 
injection into NAc or CeM, animals were tested using intracranial self- 
stimulation (ICSS) and real-time place avoidance (RTPA) assays. 

b, Representative traces of nose poke responses during ICSS. c, The relative 
number of active nose pokes was related to the experimental group (one-way 
ANOVA, Fg = 10.50, P = 0.0012), and was significantly increased by 
photostimuation of NAc projectors (NAc pr.) in comparison to controls (tis = 
4.00, **P < 0.01). CeM pr., CeM projectors. d, Representative locomotor trace 
from an animal receiving CeM projector photostimulation during RTPA. 

e, The percentage of time spent in the photostimulation-paired zone was related 
to the experimental group (one-way ANOVA, F)45 = 4.38, P = 0.019) and was 
significantly decreased by photostimulation of CeM projectors in comparison 
to controls (t43 = 2.25, *P < 0.05). Results show mean and s.e.m. 


projectors decrease following fear conditioning and increase following 
reward learning (Fig. 1g). However, these findings raise new questions. 
First, is there a causal relationship between the activity of BLA neurons 
projecting to the NAc and reward-related behaviours, and between the 
activity of CeM projectors and fearful or aversive behaviours? Second, 
what defining features of NAc and CeM projectors might endow them 
with their opposing functions? 

To test whether there is a causal relationship between populations of 
projection-identified BLA neurons and behaviour, we first used a ret- 
rogradely infectious rabies viral (RV) vector’® to express channelrho- 
dopsin-2 (ChR2) fused to a fluorescent reporter (Venus), or a control 
virus carrying Venus alone (RV-Venus) in BLA neurons projecting to 
either the NAc or the CeM (Fig. 2a). 

Following verification of functional ChR2 expression and retrograde 
transport from either the NAc or CeM back to the BLA (Extended Data 
Fig. 7), we tested animals receiving injections of RV-ChR2-Venus or 
RV-Venus into either the NAc or CeM and implantation of an optical 
fibre over the BLA on an intracranial self-stimulation (ICSS) task 
(Extended Data Fig. 4). Consistent with previous reports that photo- 
stimulation of BLA axons in the NAc produced ICSS”’, we observed 
ICSS upon photostimulation of BLA cell bodies projecting to NAc 
(Fig. 2b, c). Given that we could not elicit robust nose-poke responses 
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for CeM projector photostimulation, we next tested CeM projectors in 
a closed-loop real-time place avoidance assay (RTPA), where an animal 
freely explored two chambers, one in which the mouse received photo- 
stimulation of CeM projectors. Photostimulation of CeM projectors 
caused robust avoidance of the light-paired side (Fig. 2d, e). 
Consistent with our data and previous studies, we demonstrated a 
causal relationship between NAc projectors and positive reinforcement 
and CeM projectors and negative reinforcement (avoidance). 

We went on to probe the necessity of NAc or CeM projectors in 
mediating reward or fear conditioning. The acquisition of fear’ and 
reward’ associations is mediated by an NMDAR-dependent LTP 
mechanism thought to require simultaneous glutamate release and 
postsynaptic depolarization. Thus, we tested whether projection-spe- 
cific hyperpolarization during the presentation of the unconditioned 
stimulus could impair learning in a valence-specific manner. 

To this end, we bilaterally infused an adeno-associated viral vector 
carrying halorhodopsin fused to an enhanced yellow fluorescent pro- 
tein, or a no-opsin control (eYFP only), expressed in a Cre-dependent 
manner (double-floxed inverted open reading frame) into the BLA 
(Extended Data Fig. 5). We then bilaterally infused a retrogradely 
travelling canine adenovirus carrying Cre recombinase (CAV-Cre) 
into either the NAc or CeM (Fig. 3a). We illuminated the BLA with 
yellow light only during conditioned—unconditioned stimulus pairing, 
that is, during shock or sucrose consumption. Photoinhibition of CCM 
projectors during the conditioned-unconditioned stimulus pairing 
impaired conditioned freezing (Fig. 3b and Extended Data Fig. 6) 
and enhanced conditioned reward seeking (Fig. 3c). 

Next, because there was no topographical separation between NAc 
and CeM projectors (Fig. 1b), which are both glutamatergic’’”’””"8, we 
searched for distinguishing characteristics of these functionally distinct 
neuronal populations. As the BLA is known to have some heterogeneity 
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Figure 3 | Photoinhibition of CeM projectors impairs fear learning and 
enhances reward learning. a, Halorhodopsin (NpHR) was expressed 
bilaterally either in NAc- or CeM-projecting BLA neurons using a dual-virus 
recombination strategy. Mice underwent fear or reward conditioning and 
yellow light was delivered to the BLA during each unconditioned stimulus. 

b, Time course of percentage freezing and average freezing in trials 6-8 (inset). 
Average freezing was related to experimental condition (one-way ANOVA, 
F49 = 6.68, P = 0.0033) and was significantly reduced by photoinhibition of 
CeM projectors, relative to controls (t33 = 3.46, **P < 0.01; see inset). c, Time 
course of normalized number of port entries relative to cue presentation during 
reward conditioning and average number of normalized port entries (<8 s 
latency, inset). z-score of port entries was related to the experimental condition 
(one-way ANOVA, F) 3; = 9.23, P = 0.0008) and was significantly increased 
by photoinhibition of CeM projectors, relative to controls (t25 = 4.11, 

*** DP < 0,001). Results show mean and s.e.m. 
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in electrophysiological and morphological characteristics''”, we com- 


pared these features between NAc and CeM projectors. While we did 
not observe differences in action potential half-width (Fig. 4a, b), 
threshold to spike (Fig. 4c, d), or intrinsic excitability (Fig. 4e, f), we 
did observe a significant difference in action potential accommodation 
(Fig. 4g and Extended Data Fig. 7). To investigate the morphological 
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Figure 4 | Electrophysiological, morphological and transcriptional profiles 
of NAc and CeM projectors. a, Population average of action potential traces. 
CeM pr., CeM projectors; NAc pr., NAc projectors. b, No detectable difference 
in action potential half-width (unpaired t-test, t) = 1.82, P = 0.085). Open 
circles represent individual data points. c, Representative trace from action 
potential threshold detection protocol. d, No detectable difference in action 
potential threshold between NAc and CeM projectors (unpaired t-test, toy = 
1.05, P = 0.31). e-g, Representative trace (e) from current injection protocol to 
determine firing rate responses (f) and action potential probability (g) over 
time, which was different between NAc and CeM projectors (interaction, two- 
way ANOVA Fo,180 = 2.32, P = 0.017) in the first 100 ms of current injection 
(to99 = 4.55, ***P < 0.001). h, Representative reconstructions showing 
dendritic branching pattern in the coronal plane. Cells were classified into 
pyramidal or stellate based on the presence of an apical tuft (right). i, Sholl 
analysis of neuron reconstructions. j, Schematic of transcriptome profiling. 

k, Candidate genes identified as differentially expressed between NAc and CeM 
projectors at a 0.01 quantile fold-change threshold, corresponding to a false 
discovery rate (FDR) of 26.2% (see also Extended Data Fig. 9) across two 
independent repetitions of RNA-seq (n=8 mice for NAc pr., n=9 mice for CeM 
pr. total). Results show mean and s.e.m. 
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features of these functionally distinct populations, we reconstructed 
projection-identified BLA neurons. We observed greater distal 
dendritic branching in CeM projectors (Fig. 4h, i), though NAc pro- 
jectors and CeM projectors contained both pyramidal and stellate cells 
(Fig. 4i inset and Extended Data Fig. 8). 

Finally, we compared the transcriptomes of BLA neurons projecting 
to the NAc or CeM (Fig. 4j and Extended Data Fig. 9). Following 
retrobead injections into the NAc or CeM, we dissociated labelled 
BLA neurons and performed RNA-seq (Fig. 4j). RNA-seq revealed 
relatively few candidate genes expressed differentially between NAc 
and CeM projectors, consistent with the idea that these two popula- 
tions are closely related (Fig. 4k and Extended Data Fig. 9). However, 
these differentially expressed candidate genes may underpin the 
mechanisms that contribute to the wiring of these distinct populations 
through development and/or rapidly biasing gain modulation of syn- 
aptic transmission during valence-specific learning. 

Taken together, NAc and CeM projectors are populations of BLA 
neurons that undergo opposing synaptic changes following fear or 
reward conditioning, and optogenetic manipulation of NAc and CeM 
projectors reveals causal relationships with valence-specific behaviours. 
Further, we have identified distinguishing electrophysiological, morpho- 
logical and gene expression characteristics that facilitate further invest- 
igation. Our study suggests that the indelible nature of valence encoding 
observed in amygdala neurons’’ is mediated by connectivity, and the 
topographical intermingling of these populations may serve to facilitate 
interaction®’. In conclusion, the BLA is a site of divergence for circuits 
mediating positive and negative emotional or motivational valence. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 13 April 2014; accepted 3 March 2015. 


1. Tye, K.M., Stuber, G. D., De Ridder, B., Bonci, A. & Janak, P. H. Rapid strengthening 
of thalamo-amygdala synapses mediates cue-reward learning. Nature 453, 
1253-1257 (2008). 

2. Tye, K. M. et al. Methylphenidate facilitates learning-induced amygdala plasticity. 
Nature Neurosci. 13, 475-481 (2010). 

3. McKernan,M.G.& Shinnick-Gallagher, P. Fear conditioning induces a lasting poten- 
tiation of synaptic currents in vitro. Nature 390, 607-611 (1997). 

4. Rogan, M. T., Staubli, U. V. & LeDoux, J. E. Fear conditioning induces associative 
long-term potentiation in the amygdala. Nature 390, 604-607 (1997). 

5. Rumpel, S., LeDoux, J., Zador, A. & Malinow, R. Postsynaptic Receptor Trafficking 
Underlying a Form of Associative Learning. Science 308, 83-88 (2005). 

6. Maren, S. Synaptic mechanisms of associative memory in the amygdala. Neuron 
47, 783-786 (2005). 

7. Han, J.-H. et a/. Selective erasure of a fear memory. Science 323, 
(2009). 

8. Paton, J. J., Belova, M. A., Morrison, S. E. & Salzman, C. D. The primate amygdala 
represents the positive and negative value of visual stimuli during learning. Nature 
439, 865-870 (2006 

9. Shabel, S. J. & Janak, P. H. Substantial similarity in amygdala neuronal activity 
during conditioned appetitive and aversive emotional arousal. Proc. Natl Acad. Sci. 
USA 106, 15031-15036 (2009). 

10. Redondo, R. L. et al. Bidirectional switch of the valence associated with a 
hippocampal contextual memory engram. Nature 513, 426-430 (2014). 

11. Sah, P., Faber, E.S.L., Lopez De Armentia, M. & Power, J. The amygdaloid complex: 
anatomy and physiology. Physiol. Rev. 83, 803-834 (2003). 

12. Romanski, L. M., Clugnet, M.-C., Bordi, F. & LeDoux, J. E. Somatosensory and 
auditory convergence in the lateral nucleus of the amygdala. Behav. Neurosci. 107, 
444-450 (1993). 

13. Fontanini, A., Grossman, S. E., Figueroa, J. A. & Katz, D. B. Distinct subtypes of 
basolateral amygdala taste neurons reflect palatability and reward. J. Neurosci. 29, 
2486-2495 (2009). 


492-1496 


— 


678 | NATURE | VOL 520 | 30 APRIL 2015 


14. Pitkanen, A. in The Amygdala: A Functional Analysis 2nd edn (ed. Aggleton, J. P.) Ch. 
2 (Oxford Univ. Press, 2000). 

15. Davis, M. The role of the amygdala in fear and anxiety. Annu. Rev. Neurosci. 15, 
353-375 (1992). 

16. Cardinal, R. N., Parkinson, J. A., Hall, J. & Everitt, B. J. Emotion and motivation: the 
role of the amygdala, ventral striatum, and prefrontal cortex. Neurosci. Biobehav. 
Rev. 26, 321-352 (2002). 

17. Uwano, T., Nishijo, H., Ono, T. & Tamura, R. Neuronal responsiveness to various 
sensory stimuli, and associative learning in the rat amygdala. Neuroscience 68, 
339-361 (1995). 

18. Clem, R. L.& Huganir, R. L. Calcium-permeable AMPA receptor dynamics mediate 
fear memory erasure. Science 330, 1108-1112 (2010). 

19. Miserendino, M.J.D., Sananes, C. B., Melia, K.R. & Davis, M. Blocking of acquisition 
but not expression of conditioned fear-potentiated startle by NMDA antagonists in 
the amygdala. Nature 345, 716-718 (1990). 

20. Fanselow, M. S. & LeDoux, J. E. and others. Why we think plasticity underlying 
Pavlovian fear conditioning occurs in the basolateral amygdala. Neuron 23, 
229-232 (1999). 

21. Stuber, G. D. et al. Excitatory transmission from the amygdala to nucleus 
accumbens facilitates reward seeking. Nature 475, 377-380 (2011). 

22. Ambroggi, F., Ishikawa, A., Fields, H. L. & Nicola, S. M. Basolateral amygdala 
neurons facilitate reward-seeking behavior by exciting nucleus accumbens 
neurons. Neuron 59, 648-661 (2008). 

23. Jimenez, S.A. & Maren, S. Nuclear disconnection within the amygdala reveals a 
direct pathway to fear. Learn. Mem. 16, 766-768 (2009). 

24. Haubensak, W. et al. Genetic dissection of an amygdala microcircuit that gates 
conditioned fear. Nature 468, 270-276 (2010). 

25. Ciocchi, S. et al. Encoding of conditioned fear in central amygdala inhibitory 
circuits. Nature 468, 277-282 (2010). 

26. Wickersham, |.R. etal. Monosynaptic restriction of transsynaptic tracing from single, 
genetically targeted neurons. Neuron 53, 639-647 (2007). 

27. Paré, D., Smith, Y. & Paré, J.-F. Intra-amygdaloid projections of the basolateral and 
basomedial nuclei in the cat: Phaseolus vulgaris-leucoagglutinin anterograde 
tracing at the light and electron microscopic level. Neuroscience 69, 567-583 
(1995). 

28. Tye, K.M. etal. Amygdala circuitry mediating reversible and bidirectional control of 
anxiety. Nature 471, 358-362 (2011). 

29. Washburn, M.S. & Moises, H.C. Electrophysiological and morphological properties 
of rat basolateral amygdaloid neurons in vitro. J. Neurosci. 12, 4066-4079 (1992). 

30. Janak, P. H. & Tye, K. M. From circuits to behaviour in the amygdala. Nature 517, 
284-292 (2015). 


Acknowledgements We thank C. Wildes, G. Glober, R. Luck and G. F. Conyers for 
technical assistance. We thank the entire Tye laboratory, B. Okaty, U. Eser, J. di lulio and 
A. Pfenning for helpful discussion. We thank E. J. Kremer for providing the CAV2-Cre virus, 
R. Neve for the HSV virus carrying Cre recombinase and mCherry, and the UNC vector 
core for the AAV viruses carrying double floxed constructs. K.M.T. is a New York Stem Cell 
Foundation-Robertson Investigator and holds a Whitehead Career Development Chair. 
This work was supported by funds from NIMH (RO1-MH102441-01), NIDDK 
(DP2-DK-102256-01), the JPB Foundation (PIIF and PNDRF), NARSAD, Klingenstein, 
Whitehall and Sloan Foundations (K.M.T.). Additionally, S.Y. and J.G. were supported by 
NIMH (RO1-MH101528-01), P.N. was supported by Singleton, Leventhal and Whitaker 
fellowships, and A.B. was supported by a fellowship from the Swiss National Science 
Foundation. R.W. was supported by post-doctoral fellowships from the Simons Center for 
the Social Brain and the Netherlands Organization for Scientific Research (NWO) 
RUBICON fellowship program. |.R.W. was supported by seed grants from the McGovern 
Institute for Brain Research, the Picower Institute for Learning and Memory, the MIT 
Department of Brain and Cognitive Sciences, and the Simons Center for the Social Brain, 
as well as by BRAIN Initiative awards from NIMH, NEI, and NINDS (U01-MH106018 and 
U01-NS090473) and NSF (lOS-1451202). 


Author Contributions K.M.T. supervised all experiments. P.N., A.B. and K.M.T. designed 
the experiments. P.N., A.B., G.G.C., S.Y., R.W., S.S.H., M.A,, S.A.H., and K.L.M. collected data. 
P.N., S.S.H., A.C.F.-O. and K.L.M. reconstructed neurons. P.N. wrote all MATLAB scripts. 
S.Y. and J.G. contributed to RNA-seq experiments and analyses. P.N., A.B. and G.G.C. 
analysed all other data sets. |.R.W. designed and provided rabies viral vectors. K.M.T., P.N., 
A.B and G.G.C. wrote the manuscript and all authors read and edited the manuscript. 


Author Information RNA-seq data has been deposited in the NCBI Gene Expression 
Omnibus (GEO) under accession code GSE66345. Reprints and permissions 
information is available at www.nature.com/reprints. The authors declare no 
competing financial interests. Readers are welcome to comment on the online version 
of the paper. Correspondence and requests for materials should be addressed to K.M.T. 


(kaytye@mit.edu) 


©2015 Macmillan Publishers Limited. All rights reserved 


METHODS 

Animals and stereotaxic surgery. Adult wild-type male C57BL/6 mice (248 
mice), aged 6-12 weeks (8.3 + 1.5 weeks; Jackson Laboratory or Charles River 
Laboratories for RNA-seq) were used for experiments. Following surgery, animals 
were maintained on a reverse 12 h light/dark cycle with ad libitum food and water. 
All procedures of handling animals were in accordance with the guidelines 
from the NIH, and with approval of the MIT or Harvard Medical School 
Institutional Animal Care and Use Committee. All surgeries were conducted 
under aseptic conditions using a digital small animal stereotaxic instrument 
(David Kopf Instruments). 

Mice were anaesthetized with isoflurane (5% for induction, 1.5-2.0% afterward) 
in the stereotaxic frame for the entire surgery and their body temperature was 
maintained with a heating pad. In order to label basolateral amygdala (BLA) 
neurons projecting to the nucleus accumbens (NAc), about 70 nl of red or green 
retrobeads (RetroBeads, Lumafluor Inc.) were injected into the NAc at stereotaxic 
coordinates from bregma: + 1.4 mm anteroposterior, +0.87 mm mediolateral and 
—4.7 mm dorsoventral. In order to label BLA neurons projecting to the medial 
part of the central amygdala (CeM), 50 nl of retrobeads (different colour from NAc 
injection) were injected in the contralateral CeM (—0.75 mm anteroposterior, 
+2.35 mm mediolateral and —5.08 mm dorsoventral). To test the causal role of 
BLA neurons projecting to NAc or to CeM in reward and aversive behaviours, we 
injected a retrogradely travelling rabies virus carrying channelrhodopsin2-Venus 
fusion protein (RV-4ChopV(B19G)) or Venus’? (Fig. 2 and Extended Data Fig. 
4) into NAc (250 nl) or CeM (180 nl). These virus constructs are referred to as RV- 
ChR2-Venus and RV-Venus in the manuscript. All animals receiving control 
virus injection were pooled into one control group. Injections were performed 
using glass micropipettes (1-5 yl; Drummond Scientific) pulled with a puller 
(Narishige PC-10) mounted on 10-11 microsyringes (Hamilton Microlitre 701; 
Hamilton Co.) to deliver the retrobeads at a rate of 2 nls‘ or virus at a rate of 0.5 
to 1 nls ', using a microsyringe pump (UMP3; WPI) and controller (Micro4; 
WPI). After completion of the injection, the pipette was raised 100 jm and left for 
an additional 10 min to allow diffusion of the retrobeads or the virus at the 
injection site and then slowly withdrawn. 

In a separate group of experiments (Fig. 3), adeno-associated virus serotype 5 
(AAVs) carrying halorhodopsin 3.0 fused to enhanced yellow fluorescent protein 
(eYFP) in a double-floxed inverted open reading frame (DIO) under the control of 
EF1la promoter (AAVs5-EFla-DIO-eNpHR3.0-eYFP) or a corresponding fluoro- 
phore control (AAV;-EF1a-DIO-eYFP) was injected bilaterally into the BLA (400 
nl in each hemisphere) at stereotaxic coordinates from bregma: — 1.60 mm ante- 
roposterior, +3.35 mm mediolateral and —4.90 mm dorsoventral. Concurrently, 
canine adenovirus 2 (CAV2)**** carrying Cre-recombinase (or a mixture of 
CAV2-Cre and herpes simplex virus carrying Cre-recombinase and the fluor- 
escent reporter mCherry under the control of EFla promoter) was injected into 
the NAc (300 nl in each hemisphere) or CeM (150 nl in each hemisphere) at the 
same coordinates described above. We summarize this strategy with the designa- 
tion ‘Cre-DIO’. All animals receiving fluorophore control injection were pooled 
into one control group. In this manuscript, CAV refers to CAV2, and NpHR refers 
to eNpHR3.0. 

In order to deliver light to the BLA, a 300-j1m diameter optic fibre (0.37 numer- 
ical aperture (NA)) glued to a 1.25 mm ferrule was implanted above the BLA 
(—1.64 mm anteroposterior, +3.35 mm mediolateral and —4.25 mm dorsoventral). 
One layer of adhesive cement (C&B metabond; Parkell) followed by cranioplastic 
cement (Dental cement; Stoelting) was used to secure the fibre ferrule to the skull 
and, 20 min later, the incision was sutured. Blue light (~20 mW, ~283 mW per 
mm‘ at the fibre tip) was delivered using a 473 nm laser and yellow light (~10 mW, 
~141 mW per mm’ at the fibre tip) was delivered using a 589/593 nm laser. 

After surgery, body temperature was maintained using a heat lamp until the 
animal fully recovered from anaesthesia. Behavioural experiments followed by ex 
vivo electrophysiological recordings were conducted approximately 2 weeks after 
surgery. Behavioural experiments requiring expression via rabies virus were con- 
ducted 4-5 days after surgery (Fig. 2). Behavioural experiments requiring viral 
expression via the Cre-DIO strategy were conducted 3 months after surgery 
(Fig. 3). 

Pavlovian conditioning. Fear conditioning. The mice were conditioned using 
behavioural hardware boxes (MedAssociate) placed in custom-made sound- 
attenuating chambers. Each box contained a modular test cage with an electrifiable 
grid floor and a speaker. On the day of conditioning, all animals were exposed to 
two 30 min conditioning sessions separated by 20 min in the home cage (Extended 
Data Fig. le). During the first session, mice in the unpaired group received six tone 
presentations, while mice in the paired group received no cues. During the second 
session, mice in the unpaired group received six foot shocks whereas mice in the 
paired group received six tones, each co-terminating with a shock (paired fear). 
During each session, a period of acclimation lasting 200 s preceded the presentation 
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of the first tone or foot shock. The conditioned stimulus consisted of a 2 kHz, 80 dB 
pure tone lasting 20 s. The unconditioned stimulus consisted of scrambled 1.5 mA 
foot shock lasting 2 s. During paired conditioning, the conditioned and uncon- 
ditioned stimuli were co-terminating. Cue presentations were separated by 70 to 
130 s. Following conditioning, mice were returned to their home cages for ~24 h 
until preparation of brain slices (Extended Data Fig. le). Videos of the mice were 
acquired during the second session for the paired group and both sessions for the 
unpaired group, to allow freezing quantification during the tone (an infrared LED 
was switched on during the period of the tone to synchronize CS with the video). 
Measurement of fear behaviour. Percentage time freezing during conditioned 
stimulus presentation (Extended Data Fig. 1f) was quantified by manual scoring 
with the help of custom-written software in MATLAB. For each of the six trials, a 
segment of the video containing 20 s of the tone and the 20 s preceding the tone 
was extracted frame by frame and exported into MATLAB. Two additional seg- 
ments of the same duration were extracted starting at randomly chosen points in 
the video. The sequence of these eight trials was then randomized and presented to 
a human scorer who was blind to the trial number. The scores were then re- 
assembled and the percentage of freezing during the conditioned stimulus was 
calculated. Frame numbers of the video containing the tone were confirmed by 
generating a heat map of the intensity of the pixels containing the infrared LED. 
Reward conditioning. Twenty hours after food restriction, the mice were condi- 
tioned in sound-attenuating boxes (MedAssociates). Each box contained a modu- 
lar test cage assembled with a sucrose delivery port, a speaker and a house light 
placed under the sucrose port. The conditioned stimulus consisted of a compound 
light-tone cue, ended by a beam break, 400 ms after port entry detection. If the 
mouse did not enter the port, the tone lasted for 30 s. The tone was a 5 kHz, 80 dB 
pure tone. For the paired group, a small volume (15 ll) of a sucrose solution (30%) 
was delivered into the port 1 s after conditioned stimulus onset, only if the mouse 
had entered the port after the onset of the previous conditioned stimulus, to 
prevent sucrose accumulation. For the unpaired group, no sucrose was delivered 
to the port. The inter-trial interval (ITI) of the conditioned stimulus presentations 
was chosen randomly from a list at runtime and was 143 + 40 s for the first 20 
conditioned stimulus presentations and 108 + 32 s for the 100 subsequent con- 
ditioned stimulus presentations. The conditioning session was terminated after 
120 sucrose deliveries and lasted for about 4 h. Performance of the mouse was 
assessed during the second half of a conditioning session. After the training ses- 
sion, the unpaired mice received the same amount of sucrose as the paired mouse 
(in their home cage 15 pl X 120 = 1.8 ml, Extended Data Fig. 1e). In the paired 
group, if the mice did not claim all the sucrose, the remaining volume was made 
available to the mouse in its home cage after the conditioning session. After 20 min 
in their home cages, all the animals had ad libitum food until the electrophysiology 
experiment the following day. 

Learning criterion for reward learning. A mouse was considered to have acquired 
the task (and classified as a learner) if the number of port entries from +1sto +8s 
relative to conditioned stimulus onset (Extended Data Fig. 1g, black line) was 
significantly higher than the number of port entries from —1 s to —8 s relative 
to conditioned stimulus onset (Extended Data Fig. 1g, grey line). Statistical sig- 
nificance was tested using a one-sided Wilcoxon rank-sum test (MATLAB) and 
the threshold for learning was set at P < 0.001. 

Conditioning for photoinhibition experiments. In these experiments, the same 
animals experienced reward conditioning followed by fear conditioning after 
1-4 weeks. Pure tones of 2 and 10 kHz were used as conditioning stimuli in these 
two paradigms; the use of these two tones was counterbalanced across animals. 
Animals were tethered using a dual commutator for light delivery. During fear 
conditioning, yellow light was delivered 1 s before shock onset until 1 s after shock 
termination (shock intensity = 0.5 mA). During reward learning, yellow light was 
delivered for 7.5 s immediately following a port entry during conditioned stimulus 
(tone) presentation. 

Ex vivo electrophysiology. Brain tissue preparation. About 2 weeks after retro- 
bead injections in NAc and CeM, 114 mice were anaesthetized with 90 mg kg 
pentobarbital sodium and perfused transcardially with 10 ml of modified artificial 
cerebrospinal fluid (ACSF) at ~4 °C saturated with 95% O, and 5% COz, contain- 
ing: 75 mM sucrose, 87 mM NaCl, 2.5 mM KCl, 1.3 mM NaH2PO,, 7 mM MgCh, 
0.5 mM CaCl,, 25 mM NaHCO; and 5 mM ascorbic acid (pH 7.25-7.4, 327 + 3 
mOsm). The brain was then extracted and glued (Roti coll 1; Carh Roth GmbH) on 
the platform of a semiautomatic vibrating blade microtome (VT 1200; Leica). The 
platform was then placed in the slicing chamber containing modified ACSF at 4 
°C. Coronal sections of 300 tm containing the NAc, CeM or BLA were collected in 
a holding chamber filled with ACSF saturated with 95% O2 and 5% CO, contain- 
ing: 126 mM NaCl, 2.5 mM KCl, 1.25 mM NaH2PO,, 1.0 mM MgCl, 2.4 mM 
CaCl;, 26.0 mM NaHCO3, 10 mM glucose (pH 7.25-7.4, 298 + 2 mOsm). 
Recordings were started 1 h after slicing and the temperature was maintained at 
approximately 31 °C both in the holding chamber and during the recordings. 
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All retrobead injection sites were checked and imaged with a camera (Hamatsu) 

attached to the microscope (BX51; Olympus, Extended Data Fig. 1). The slice 
images were reported on the mouse brain atlas (Paxinos and Watson) and the 
centre of the injection was taken at the brightest point of the fluorescence. Some of 
the retrobead injections had dorsoventral leaks. In this case, the centre of the 
injection was taken on the brightest fluorescent point in the target structure 
(Extended Data Fig. 1). If the injection site was outside NAc or CeM, respective 
projectors of this injection were not recorded in the BLA. In addition, all the 
CeM injections were overlaid on the mouse atlas. CeM projector recordings col- 
lected from animals with injection sites that had leaks into CeC and/or CeL 
were discarded. 
Whole-cell patch-clamp recording. Recordings were made from visually identified 
neurons containing retrobeads. Patched cells were filled with Alexa Fluor 350 and 
biocytin, visualized and superimposed with retrobead fluorescence to confirm 
whether the patched cell was retrobead positive. 

For measuring AMPAR/NMDAR ratio, brain slices containing the BLA were 
then placed in the recording chamber perfused with ACSF containing 100 1M of 
the y-aminobutyric A receptor (GABA,R) antagonist picrotoxin (R&D systems). 
Picrotoxin was not used to assay passive membrane properties from NAc- and 
CeM-projecting BLA neurons. A bipolar stimulating electrode (~80 jum spacing 
between tips) was placed in the amygdala-striatal transition zone containing 
internal capsule fibres (Extended Data Fig. 2). Electric stimulation intensity was 
between 0.01 and 0.2 mA. For electrophysiological characterization of NAc and 
CeM projectors (Fig. 3a—g) and confirmation of the health of rabies-virus-trans- 
duced cells (Extended Data Fig. 7), picrotoxin was not added to the ACSF. 

Voltage-clamp recordings were made using glass microelectrodes (4-6 MQ) 
shaped with a horizontal puller (P-1000) and filled with a solution containing: 120 
mM caesium methansulphonate, 20 mM HEPES, 0.4 mM EGTA, 2.8 mM NaCl, 5 
mM tetraethylammonium chloride, 2.5 mM MgATP, 0.25 mM NaGTP, 8 mM 
biocytin and 2 mM Alexa Fluor 350 (pH 7.3, 283 mOsm). The cells were first 
clamped at —70mV to determine optimal intensity for the electric stimulation of 
the internal capsule. Current-clamp recordings to characterize electrophysio- 
logical properties of NAc and CeM projectors were made using similar glass 
microelectrodes (4-6 MQ) filled with a solution containing: 125 mM potassium 
gluconate, 20 mM HEPES, 10 mM NaCl, 3 mM MgATP, 8 mM biocytin and 2 mM 
Alexa Fluor 350 (pH 7.3; 283 mOsm). 

Recorded signals were amplified using Multiclamp 700B amplifier (Molecular 
Devices). Analogue signals were digitized at 10 kHz using a Digidata 1440 and 
pClamp9 software (Molecular Devices). ACSF and drugs were applied to the slice 
via a peristaltic pump (Minipuls3; Gilson) at 3 ml per min. 

All recordings were performed blind to the performance of the animal, and a 
subset of the data was obtained blind to the behavioural conditioning group of the 
animal. We recorded the position of the cells within the BLA (Extended Data Fig. 
2) and the placement of the stimulating electrode relative to the BLA. There was no 
observable difference in either of these parameters across slices obtained from 
animals in different behavioural conditioning groups. 

In order to obtain the AMPAR/NMDAR ratio, the cell was first voltage clamped 
at +40 mV. Once we obtained a stable baseline excitatory post-synaptic current 
(EPSC) amplitude in response to internal capsule fibre stimulation (compound 
AMPAR + NMDAR current), we bath applied the NMDAR antagonist AP5 (D- 
(-)-2amino-5-phosphonopentanoate; R&D systems) at a concentration of 50 uM. 
AMPAR EPSCs were recorded starting from 5 min after the action of AP5. 
NMDAR current was obtained by subtracting the average EPSC trace of the 
AMPAR current from the compound current. Each group in Fig. 1 included 
9-13 neurons recorded from 6-9 mice. 

Histology. The location of all recorded neurons was checked after the recording. 
Co-localization of Alexa Fluor 350 and retrobeads was confirmed at the end of the 
recording, and double-checked with confocal microscopy for the cells that were 
recovered with streptavidin staining. For each experiment, the slices containing a 
retrobead injection site (NAc and CeM) or a recorded neuron (BLA) were fixed 
overnight at 4 °C in 4% paraformaldehyde (PFA), and then kept in phosphate 
buffered saline (PBS). Slices containing patched neurons were incubated for 2 h in 
streptavidin-CF405 (2 mg ml‘, dilution 1:500; Biotium), mounted on microscope 
slides with PVA-DABCO and imaged under the confocal microscope (Extended 
Data Fig. 3). 

Data analysis. Offline analysis of AMPAR/NMDAR ratios, paired-pulse ratios and 
electric properties was performed using Clampfit (Molecular Devices) and 
MATLAB software written by P.N. Membrane properties including access resist- 
ance of the cell were computed by a MATLAB implementation of the Q-method”’. 
All custom-written software is available upon request. 

In vivo optogenetic behaviour. Light delivery. For optical stimulation during 
behavioural assays (Fig. 2), a 473 nm (blue light) or 589/593 nm laser (yellow 
light) (OEM Laser Systems) was connected to a patch cord with a pair of FC/PC 


connectors in each end (Doric). This patch cord was connected through a fibre 
optic rotary joint (which allows free rotation of the fibre; Doric) with another patch 
cord with an FC/PC connector on one side and a ferrule connection on the other 
side (matching the size of the ferrule glued to the optic fibre implanted in the 
mouse). The optic fibre implanted in the mouse (300 jum diameter, 0.37 NA) was 
connected to the optic patch cord using ceramic mating sleeves (PFP). Blue light 
was delivered at 20 mW in 20 Hz, 5 ms light pulses. Yellow light was delivered at 10 
mW, for 4 s during fear conditioning and 7.5 s during reward conditioning. Laser 
output was modulated with a Master 8 pulse stimulator (A.M.P.L). Onset of laser 
pulses was determined by behavioural hardware (MedPC Associates). 
Intracranial optical self-stimulation (ICSS). 4.5 days after surgery, mice were food- 
restricted overnight. Immediately before the start of each session, fibre optic 
implants were connected to a patch cord and the mouse was placed in condition- 
ing chambers equipped with active and inactive nose-poke ports directly below 
two cue lights, as well as auditory stimulus generators and video cameras. Mice 
were given two self-stimulation sessions on two consecutive days (5.5 and 6.5 days 
after surgery) in which they could respond freely at either nose-poke port. On day 
1 both nose-poke ports were baited with a crushed cereal treat to facilitate initial 
investigation. The start of each 2 h session was indicated by the illumination of 
both nose-poke ports and the onset of low-volume white noise to mask unrelated 
sounds. Each nose poke in the active port resulted in light stimulation of BLA 
projectors (60 pulses, 20 Hz, 5 ms pulse duration). Concurrently, the cue-light 
above the respective port was illuminated and a distinct 1-s tone was played for 
each nose poke (1 kHz and 1.5 kHz counterbalanced), providing a visual and 
auditory cue whenever a nose poke occurred. Active and inactive nose-poke time 
stamps were recorded using Med-PC software and day 2 data were analysed using 
MATLAB and Microsoft Excel. 

Real-time place avoidance (RTPA). The RTPA chamber was constructed from 
transparent plastic (50 X 53 cm) and divided into two equal compartments. 
One of these was assigned as the photo-stimulated zone (counterbalanced between 
animals). At the start of the 1 h session, individual mice were placed in the non- 
stimulated side of the chamber. Every time the mouse crossed to the side of the 
chamber paired with photostimulation, 20 Hz (~20 mW, 5 ms pulse duration) 
laser stimulation was delivered until the mouse crossed back into the non-stimu- 
lated side. Ethovision XT video tracking software (Noldus Information 
Technologies) was used to track the animal and control the onset and offset of 
light pulse trains. Data were subsequently analysed using MATLAB and Microsoft 
Excel software. 

Histology. After optogenetic experiments, all mice were anaesthetized with pen- 
tobarbital sodium, and transcardially perfused with ice-cold Ringer’s solution 
followed by ice-cold 4% PFA in PBS (pH 7.3). Extracted brains were fixed in 4% 
PFA overnight and then equilibrated in 30% sucrose in PBS. 40-j1m thick coronal 
sections were sliced using a sliding microtome (HM430; Thermo Fisher Scientific) 
and stored in PBS at 4 °C until they were processed for histology. Sections were 
then incubated with a DNA-specific fluorescent probe (DAPI: 4’ ,6-diamidino-2- 
phenylindole (1:50,000)) for 30 min, and finally washed with 1X PBS followed by 
mounting on microscope slides with PVA-DABCO. 

Imaging. Confocal fluorescence images were acquired on an Olympus FV1000 
confocal laser scanning microscope using a 10X/0.40 NA objective for viral injec- 
tions and fibre placements imaging or a 40X/1.30 NA or 60X/1.42 NA oil-immer- 
sion objectives for imaging streptavidin-CF405 stained neurons. The centre of the 
viral injection was taken at the brightest fluorescent point in anteroposterior, 
mediolateral and dorsoventral axis. The tip of the fibre was determined by the 
~50-1m thick gliosis generated by the fibre. Neurons recovered from the strepta- 
vidin staining (~60% recorded in the whole cell) were imaged covering the whole 
dendritic and axonal arbourization contained in the slice. 

Neuron reconstructions. Imaris software (Bitplane Inc.) was used to reconstruct 
neurons from z-stacks of confocal images and to perform Sholl analysis**. Since we 
are reconstructing neurons filled during whole-cell patch clamp recordings, we 
chose to focus solely on dendritic branching patterns and did not examine para- 
meters such as the volume of the soma, since this measure might be compromised 
by experimental procedures. Overlaying the atlas on images of brain slices revealed 
that the distances depicted in the atlas are about 90% smaller than physical dis- 
tances in brain slices. For example, if the length of a brain region is 900 jm in the 
atlas, then the real distance measured in our brain sections was about 1000 Lm. 
Therefore, to depict the reconstructed neurons (Fig. 4h, i and Extended Data 
Fig. 8), we multiplied physical distances by a factor of 0.9. 

RNA sequencing (RNA-seq). Manual cell sorting and RNA sequencing. The 
RNA-seq experiment was repeated twice to verify reproducibility, and we refer 
to these biological replicates as experiment 1 and experiment 2 (see Extended 
Data Fig. 9c: samples used in experiment 1 and 2 are indicated in black (n = 9) 
and blue (n = 8) respectively below the heatmap). Manual sorting of fluores- 
cent cells were carried out as described in ref. 37. In brief, adult C57Bl6 male 
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mice (~6-8-weeks-old at surgery, CS57BL/6NCrl for experiment 1, C57BL/6] for 
experiment 2) were injected with red fluorescent retrograde beads into either NAc 
or CeM. Surgeries for experiment 1 were conducted at Harvard Medical School 
by P.N. and surgeries for experiment 2 were conducted at MIT by G.G.C. About 
2 weeks after surgery, animals were decapitated under isoflurane anaesthesia and 
their brain was quickly removed and transferred into ice-cold oxygenated ACSF, 
containing 126 mM NaCl, 20 mM NaHCO;, 20 mM dextrose, 3 mM KCl, 1.25 
mM NaH>PO,, 2 mM CaCl, 2 mM MgCh, 50 uM AP5, 20 1M DNQX and 100 
nM TTX. Acute 330-um coronal brain slices were prepared and incubated in a 
protease (1.2 mg ml ' protease E; Sigma-Aldrich) containing oxygenated ACSF 
for 50 min. After 15 min of washes in the ACSF, BLA tissue was microdissected 
using a pair of fine scissors under a fluorescent dissecting microscope (Leica 
M165FC stereomicroscope). The dissected BLA tissue was then triturated in 
ACSF using a series of three Pasteur pipettes of decreasing tip diameters and 
the dissociated cells were transferred into a small Petri dish. With visual control 
under a fluorescent dissecting microscope, red-retrobead-positive neurons were 
aspirated into a micropipette with a 30-50 jim tip diameter and transferred into a 
clean Petri dish. A total of 35-60 retrobead-positive neurons were pooled for each 
sample, which were immediately lysed in 50 kl of extraction buffer (PicoPure RNA 
isolation kit, Arcturus, Life Technologies) and total mRNA was subsequently 
isolated. Complementary DNA was synthesized using Ovation RNA-seq System 
V2 kit (Neugen). We obtained approximately 6 1g of CDNA from 35-60 cells from 
each group. Then, the cDNA library was prepared using Ovation Ultralow 
DRMultiplex Systems (Nugen). Sequencing was conducted on an Illumina 
HiSeq2500 using single-end 50 base pairs at the Biopolymer facility, Harvard 
Medical School (for experiment 1), and an illumina NextSeq 500 using single- 
end 75 base pairs with high flow cell size at the FAS centre for systems biology, 
Harvard University (for experiment 2). The total number of reads that we obtained 
for each sample were approximately 34 million (for experiment 1) and 50 million 
(for experiment 2). 

Analysis of RNA-seq data. Sequencing reads were mapped using Tophat version 
2.0.10 (http://ccb.jhu.edu/software/tophat/index.shtml) against the Mus musculus 
UCSC version mm10 genome. After alignment, the read counts for each gene were 
extracted using HTseq (http://www-huber.embl.de/users/anders/HTSeq/doc/ 
overview.html) based on the mm10 Refseq gff file. Log2 fold differences were 
computed from each of two independent experiments using DEseq2. Candidate 
differentially expressed genes were required to be enriched in CeM or NAc pro- 
jectors at a quantile fold-difference threshold of 0.01 (Fig. 4) or 0.02 (Extended 
Data Fig. 9) in each of two independent experiments (n = 8, NAc n = 9, CeM 
total). To estimate false discovery rate (Extended Data Fig. 9), we used two types of 
chance estimates. One of the chance estimates, ‘flip-flopped’, is taken from genes 
that passed the quantile thresholds but were enriched in opposite populations in 
the two experiments. Another chance estimate, ‘permuted’, is determined based on 
permuting fold differences across genes within each independent experiment 
(Extended Data Fig. 9). We estimated false discovery rate using the more conser- 
vative chance estimate, flip-flopped with the following formula: (number of genes 
on the flip-flopped list/number of genes on the differentially expressed genes). 
RNA-seq data has been deposited in the Gene Expression Omnibus under acces- 
sion code GSE66345 

Statistical analysis. Statistical analyses were performed using commercial soft- 
ware (GraphPad Prism; GraphPad Software, Inc.). Within-subject comparisons 
were made using paired tests. Group differences were detected using either one- 
way analysis of variance (ANOVA) or with two-way ANOVA, both followed by 
Bonferroni post-hoc tests. Corrections for multiple comparisons were made when 
appropriate. The reported numbers of degrees of freedom (df) for each one-way 
ANOVA are between column degrees of freedom and total degrees of freedom. 
Since normality tests have little power to detect non-gaussian distributions with 
small data sets, we did not explicitly test for the normality of our data sets. We used 
the Grubbs’ test to detect and remove outliers from our data. Single-variable 
differences were detected with two-tailed paired or unpaired (as noted) 
Student’s t-tests. For all results, the significance threshold was placed at « = 
0.05 (*P < 0.05, **P < 0.01, ***P < 0.001), and corrections for multiple com- 
parisons were reflected in the P value rather than in «. All data are shown as mean 
and s.e.m. To assess learning during the reward task, we used a one-sided 
Wilcoxon rank-sum test (MATLAB) and set the threshold for learning at P < 
0.001. Result sheets of statistical tests from GraphPad software detailing (wherever 
applicable) estimates of variance within each group, confidence intervals, effec- 
tiveness of pairing (in case of paired t-tests), comparison of variances across 
groups, etc. are available upon request. 

Sample size. The target number of samples in each group was determined based 
on numbers reported in published studies. No statistical methods were used to 
predetermine sample size. In the photoinhibition experiment, since the viral 
incubation time was long, we factored in the skill of the surgeon to determine 
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the number of surgeries to be performed. Our target number of animals in each 
group was 12. The experimenter performing surgeries was known to hit the targets 
used (NAc, CeM, BLA) with a probability of 0.9. Since there were six targets in each 
brain (three in each hemisphere—two injections and one optical fibre), the prob- 
ability of a successful surgery would be approximately (0.9)° ~ 0.5. We therefore 
performed about 24 surgeries in each group. All sample sizes mentioned in figures 
represent biological replicates. All animals receiving control virus injections were 
pooled into one control group (Figs 2 and 3). In the Venus group from Fig. 2c, 
there were six animals with injections in NAc and six animals with injections in 
CeM. In the Venus group from Fig. 2e, there were four animals with injection in 
NAc and two animals with injection in CeM. In the eYFP group from Fig. 3b, there 
were ten animals with injections in NAc and six animals with injections in CeM. In 
the eYFP group from Fig. 3c, there were nine animals with injections in NAc and 
five animals with injections in CeM. 

Replication. Results from AMPAR/NMDAR ratio experiments were replicated 
once with a different experimenter and the final numbers reported in the paper are 
pooled across both repetitions of the experiment (Fig. 1). Photostimulation and 
photoinhibition experiments (Figs 2 and 3) were not replicated. The RNA-seq 
experiment was also replicated once (Fig. 4). 

Randomization. All surgical and behavioural manipulations performed on each 
animal were determined randomly. All randomization was performed by an 
experimenter, and no explicit randomization algorithm was used. For animals 
used in photostimulation and photoinhibition experiments, the virus used in each 
animal (ChR2/Venus or NpHR/eYFP) and injection site (NAc/CeM) were deter- 
mined randomly and the stereotaxic apparatus used for surgery was counterba- 
lanced across groups. For surgeries with unilateral injections and/or fibre 
placements, the hemisphere used for injections was determined randomly during 
the time of the surgery. Surgeries were performed on animals caged in groups of 4 
or 5 animals. Animals from each cage were allocated to at least two behavioural 
groups. All animals used in ex vivo electrophysiology experiments were isolated at 
least 1 day before behavioural conditioning. 

Exclusion criteria. Ex-vivo electrophysiology. Data were excluded based on pre- 
determined histological and electrophysiological criteria, established during pilot 
experiments. The injection site was determined as the most ventral point where 
fluorescence was brightest, and data from cells where the corresponding retrobead 
injection was outside the target region (NAc or CeM) were excluded (Extended 
Data Fig. la-d). Each recorded cell was confirmed to be a projector by overlaying 
the fluorescence from retrobeads with the fluorescence from Alexa Fluor dye 
contained in the pipette. Each cell was also confirmed to be in the BLA by visu- 
alizing with differential interference contrast microscopy under a 4X objective. 
We did a secondary confirmation under the confocal microscope for cells that 
were recovered from streptavidin staining. Cells in which evoked responses were 
polysynaptic (multiple peaks in the evoked current) were discarded. Data from 
cells whose access resistance was greater than 40 MQ or cells that died during 
recording were also excluded. 

Animals used in photostimulation and photoinhibition experiments. Data from 
animals used in photostimulation and photoinhibition experiments (Figs 2 and 
3) were excluded based on histological and performance criteria established during 
pilot experiments. Histological criteria included injection sites and optical fibre 
placement (Extended Data Figs 4 and 5). Only animals with injection sites in the 
region of interest (NAc or CeM) were included. For animals with a rabies virus 
injection in CeM, atlas outlines were overlaid manually over a confocal image of 
the BLA containing the damage caused by the tip of the optic fibre. Light cones 
based on numerical aperture of the optic fibre (NA 0.37, ~15° half angle) were 
then drawn below the optic fibre and animals in which light cones encompassed 
central amygdala were excluded from further analysis. For the optical self-stimu- 
lation experiment, data from animals that did not respond at least 40 times (sum of 
nose pokes in active and inactive ports) over the 2 hour period were excluded from 
further analysis. For photoinhibition experiments, the amount of expression in 
each hemisphere of the BLA was rated on a scale of 0-5 based on fluorescence 
intensity by an experimenter blind to the behavioural performance of the animal. 
These ratings were gathered in an excel sheet, read by a MATLAB script and only 
data from animals with fluorescence ratings greater than 4 in each hemisphere 
were included for further analysis. 
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Extended Data Figure 1 | Histological verification of retrobead injection 
sites and behavioural quantification of fear and reward conditioning for 
mice used in Fig. 1. a, Representative differential interference contrast (DIC) 
image of a 300-j1m thick coronal slice containing the centre of the retrobead 
injection in NAc . The white circle indicates the most ventral point at which 
fluorescence is brightest and corresponds to the filled green circle in 

b. b, Location of all retrobead injection sites (green circles) in the NAc for all 
mice used in Fig. 1. Each atlas schematic represents a 1.5 mm X 1.5 mm region 
of the atlas and the corresponding anteroposterior stereotaxic coordinate 
relative to Bregma is indicated below. c, Representative DIC image of a 300-um 
thick coronal slice containing the centre of the retrobead injection in CeM as 
indicated by the white dot. d, Retrobead injection sites in CeM (red circles) for 
all mice used in Fig. 1, with the example from c indicated by the filled red circle. 
The corresponding anteroposterior stereotaxic coordinate relative to Bregma is 
indicated below. e, Experimental design for AMPAR/NMDAR ratios from 
Fig. 1. Either red or green retrobeads were injected in the NAc and the other 
colour in the contralateral CeM. Two weeks after injection, the retrobeads had 
travelled back to the cell bodies of the BLA neurons projecting to NAc or CeM. 
Animals were conditioned 1 day before ex vivo whole-cell patch-clamp 
recordings. Each mouse received one of six conditioning protocols, three 
protocols categorized under ‘fear conditioning’ and three protocols categorized 
under ‘reward conditioning’. Fear conditioning protocols: (i) naive, animals 
were naive to the operant chamber. (ii) Unpaired, animals were exposed to the 
conditioning chamber in two sessions. Animals received six tones in the first 
session and they received six foot shocks in the second session. Animals were 
returned to their home cage for ~20 min between the two sessions. (iii) Paired, 
animals were exposed to the operant chamber in two sessions. Animals did not 
receive any tone or shock stimuli in the first session, and received tones co- 
terminating with shocks in the second session. Animals were returned to their 
home cage for 20 min between the two sessions. Protocols for unpaired and 
paired fear groups were adapted from ref. 18. Reward conditioning protocols: 
(i) naive food restricted (FR), animals naive to the operant chamber were food 
restricted two days before ex vivo experiments and had free access to food for 1 
day before ex vivo experiments. We used this group to control for changes in 
synaptic strength caused by food restriction which was necessary in reward 
conditioning groups to expedite task acquisition, adapted from rats as in refs 1 
and 2. (ii) Unpaired, animals received tones in the operant chamber, were 


returned to their home cage for ~20 min after which they had free access to 1.8 ml 
of sucrose, followed by free access to food until ex vivo experiments. (iii) Paired, 
sucrose was delivered into a port 1 s after the onset of a tone, and the tone was 
terminated 400 ms after the animal entered the port to claim sucrose. The tone 
lasted for a maximum length of 30 s. If there was sucrose in the port during the 
onset ofa tone (indicated by the absence of a port entry after the previous tone), 
then no sucrose was delivered in that trial. Mice could receive up to 120 sucrose 
deliveries and the conditioning session lasted about 4 h after which they had 
free access to food until ex vivo experiments. Behavioural performance from the 
second half of the conditioning session was used to assess performance and 
mice that met learning criterion (see Methods) were categorized in the learner 
group and the rest of the mice were categorized in the non-learner group. One 
day after conditioning, BLA neurons identified as either NAc or CeM 
projectors (retrobead positive) were recorded with whole-cell patch-clamp in 
ex vivo brain slices. Ex vivo data from both NAc and CeM projectors were 
collected from the following 7 groups: (1) naive (n = 12 mice, 9 for NAc pr., 7 
for CeM pr.); (2) unpaired fear (n = 13 mice, 7 for NAc pr., 9 for CeM pr.); (3) 
paired fear (n = 10 mice, 7 for NAc pr., 7 for CeM pr.); (4) naive food restricted 
(n = 11 mice, 7 for NAc pr., n=8 for CeM pr.); (5) unpaired (n = 11 mice, 7 for 
NAc pr., 5 for CeM pr.); (6) reward paired non-learner (n = 9 mice, 5 for NAc 
pr., 6 for CeM pr.); and (7) reward paired learner (n = 10 mice, 8 for NAc pr.,7 
for CeM pr.) groups. The n values indicated here are the number of mice used in 
Fig. 1. Data from groups 1-5 and 7 are shown in Fig. 1 and Extended Data Figs 
1, 2 and 3. Data from group 6 is shown only in Extended Data Fig. 3. f, Time 
course of percentage freezing for the paired fear group. Percentage freezing was 
estimated during the shock-predictive tone (excluding the final 2 s, where the 
foot shock was delivered). g, Average normalized histogram of port entries 
relative to the onset of the tone predicting sucrose delivery for mice that learned 
the conditioned-unconditioned stimulus association (learners, n = 11) and 
mice that did not (non-learners, n = 17; see Extended Data Fig. 3). Mice in the 
paired reward conditioning group were deemed learners if the number of port 
entries in the post-conditioned-stimulus period (1 to 8 s relative to conditioned 
stimulus onset, black line) were determined as significantly higher than the 
number of port entries in the pre-conditioned stimulus period (—8 to —1s 
relative to conditioned stimulus onset, grey line) using a one-sided Wilcoxon 
rank sum test (P < 0.001). 
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Extended Data Figure 2 | Location of BLA projectors recorded and analysed 
for each experimental group in Fig. 1. Top, representative DIC image 
showing the location of the stimulation electrode around a bundle of fibres of 
the internal capsule and a neuron recorded in the BLA (at the tip of the 
micropipette). The location of the recorded cell is indicated by an orange open 
circle. Scale bar, 200 Lum. Bottom, atlas schematics (1.5 mm X 1.5 mm) showing 


-1.58 mm 


-1.82 mm -2.18 mm 


BLA at various anteroposterior positions relative to Bregma. Each circle 
represents the location of a neuron from which the AMPAR/NMDAR ratio 
was acquired (Fig. 1). NAc projector locations are summarized in rows 1 and 2 
and CeM projector locations are summarized in rows 3 and 4. Colour of 

the circle represents the conditioning group of the animal from which the 
AMPAR/NMDAR ratio was acquired. 
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Extended Data Figure 3 | Paired-pulse ratio and AMPAR/NMDAR ratio in 
non-learners and food-restricted naive animals. a, Confocal image of a 
representative retrobead-positive neuron recorded in BLA after injection of 
retrobeads into NAc. This cell was recorded in an ex vivo slice, filled with 
biocytin and stained with streptavidin-CF405, pseudo-coloured white. b, In 
NAc projectors, the ratio of EPSC amplitude in response to paired-pulse 
stimulation (50 ms inter-pulse interval) of internal capsule inputs to the BLA 
was not related to experimental conditions of fear (one-way ANOVA, Fo,44 = 
0.5209, P = 0.5978). c, Paired-pulse ratio of EPSC amplitude was not related to 
experimental conditions of reward (one-way ANOVA, F361 = 0.5868, P = 
0.6261). d, AMPAR/NMDAR ratio of internal capsule inputs onto NAc 
projectors in mice with unpaired tone and sucrose presentations (unpaired) 
and mice that did not learn the cue-reward association (non-learner) were not 
different (unpaired t-test t), = 0.180, P = 0.8595). Both groups of mice received 
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the same amount of total sucrose. e, AMPAR/NMDAR ratio on NAc projectors 
is significantly decreased by food restriction in naive mice (unpaired t-test, 
too = 2.626, P = 0.0162). f, Confocal image of a representative retrobead- 
positive neuron recorded in BLA after retrobead injection in CeM. g, Paired- 
pulse ratio of EPSC amplitude onto CeM projectors is not related to 
experimental conditions of fear (one-way ANOVA, F229 = 0.9040, P = 
0.4169). h, Paired-pulse ratio of EPSC amplitude is not related to experimental 
conditions of reward (one-way ANOVA, F3,44 = 0.9770, P = 0.4129). 

i, AMPAR/NMDAR ratio on CeM projectors is similar in unpaired reward and 
paired reward non-learner mice (unpaired t-test t,4 = 0.381, P = 0.7090). 

j, AMPAR/NMDAR ratio of internal capsule inputs onto CeM projectors 

is significantly increased by food restriction in naive mice (unpaired t-test 

typ = 2.526, P = 0.0201). Results show mean and s.e.m. 
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Extended Data Figure 4 | Histological verification of viral injection site and 
fibre placement for photostimulation experiments used in Fig. 2. a, Center 
of the rabies virus injection in NAc for the animals tested in intra-cranial 
self-stimulation (ICSS) and real-time place avoidance (RTPA) paradigms 
(Fig. 2a-e). Rabies virus (RV)-ChR2-Venus injections are denoted with green 
circles, and RV-Venus injections are indicated with grey squares. 

b, Representative confocal image of viral expression in a mouse 6 days after 
RV-ChR2-Venus injection in NAc. Right panel, enlarged view of the brightest 
fluorescence point (white circle), corresponding to the filled green circle in 

a. c, Center of RV-ChR2-Venus (red diamonds) and RV- Venus (grey squares) 
injections in CeM of animals analysed in Fig. 2. d, Example of viral expression 
6 days after RV-ChR2-Venus injection in CeM. Right panel, enlarged view of 
the brightest fluorescence point (white circle), corresponding to the filled red 


diamond in c. e, Optical fibre tip placements over BLA of animals with RV- 
ChR2-Venus injected in NAc (green circles), CeM (red diamonds) or RV- 
Venus in NAc or CeM (grey squares). Horizontal lines represent the thickness 
of the implanted fibre (300 jum). f, Representative confocal image showing 
optical fibre tip from a RV-ChR2-Venus injection in NAc, corresponding to 
the filled green circle in e. Region in the white rectangle is magnified in the right 
panel and shows rabies-virus-expressing NAc projectors. g, Representative 
optic fibre placement for RV-ChR2-Venus injection in CeM, corresponding to 
the filled diamond in (e). Right panel: enlarged image of the BLA, containing 
rabies-virus-expressing CeM projectors. Atlas schematic in a, c and e represent 
1.5mm X 1.5 mm of the brain and the corresponding anteroposterior 
coordinates relative to Bregma are specified below. Scale bars in b, d, f and 

g are 500 um. 
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Extended Data Figure 5 | Histological verification of viral injection site and 
fibre placement for photoinhibition experiments used in Fig. 3. a, Centre of 
canine adenovirus (CAV)-Cre injection into bilateral NAc of mice with AAV;- 
EFla-DIO-NpHR-eYFP (green circles) or AAV5-EF1a%-DIO-eYFP (grey 
squares) injected bilaterally into the BLA. This approach allows for selective 
expression of NpHR-eYFP/eYFP, in NAc-projecting BLA neurons. 

b, Representative confocal image of the CAV-Cre injection site in NAc. 

c, Center of CAV-Cre injection into CeM from both hemispheres of mice with 
AAV;-EFla-DIO-NpHR-eYFP (red diamonds) or AAV;-EF1a%-DIO-eYFP 
(grey squares) injected bilaterally into BLA. In these animals, CeM-projecting 
BLA neurons express NpHR-eYFP or eYFP, respectively. d, Confocal image of 


a representative CeM injection and NpPHR-eYFP-expressing cells bodies in the 
BLA. e, Optical fibre tip placements over BLA from both hemispheres in 
animals injected with AAV;-EFla-DIO-NpHR-eYFP in BLA and CAV-Cre in 
NAc (green circles) or CeM (red diamonds), or AAV5-EF1o-DIO-eYFP in BLA 
and CAV-Cre in NAc/CeM (grey squares). Horizontal lines represent thickness 
of the implanted fibre (300 jim). f, Representative confocal images of optic fibre 
placements over BLA from both hemispheres of an animal injected with CAV- 
Cre in NAc and AAV;-EF1a-DIO-NpHR-eYFP in BLA. Note NpPHR-eYFP- 
expressing NAc projectors in the BLA. Each atlas diagram and confocal image 
in a-f represents an area of 1.5 mm X 1.5 mm; anteroposterior stereotaxic 
coordinates relative to Bregma are specified to the left of each image. 
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Extended Data Figure 6 | Tone-evoked freezing behaviour following b, Time course of percentage freezing in mice expressing NpHR in NAc 
inhibition of CeM or NAc projectors during auditory fear conditioning. projectors (green circles), CeM projectors (red diamonds), or expressing eY FP 
a, Experimental design. Mice were trained in an auditory fear conditioning in NAc or CeM projectors (grey squares) was quantified for each trial. c, There 
paradigm, during which NAc or CeM projectors were selectively inhibited was no significant difference in freezing behaviour in response to the 
using a dual virus recombination approach (Fig. 3). On the day following conditioned stimulus among the three groups of mice on test day (one-way 
conditioning, mice were exposed to eight presentations of the conditioned ANOVA, Fy 3s = 2.010, P = 0.1488). Results show mean and s.e.m. 


stimulus alone. They were tethered to a patch cable but no light was delivered. 
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Extended Data Figure 7 | Membrane properties of retrobead-positive NAc/ 
CeM-projecting BLA neurons and rabies-virus-expressing BLA neurons. 
a, Access resistance, membrane resistance, and membrane capacitance were 
estimated from the current response of the cell to a 4 mV square voltage pulse 
using the Q-method**. Access and membrane resistance as well as the 
membrane capacitance and membrane potential were not significantly 
different between the two populations (unpaired t-tests: ta) = 0.788, P = 
0.4400; try = 1.599, P = 0.1256; toy = 1.847, P = 0.0796; and tyg = 0.2521, P= 
0.8038, respectively). The holding current corresponds to the current injected 
to clamp the cell at —70 mV. This value was not significantly different between 
NAc and CeM projectors (unpaired t-test, ty) = 1.046, P = 0.3079). b, Confocal 
image of a BLA-NAc projectors expressing ChR2-eYFP transduced by rabies 
virus (RV) and recorded ex vivo in whole-cell patch-clamp. The cell was filled 
with biocytin during recording and stained with streptavidin-CF405 (in grey). 
c, Current response to a 1-s blue light pulse in a cell expressing rabies virus, 5 
days after injection. d, Five days after viral injection, rabies-virus-expressing 


cells were able to respond with an action potential to every pulse of a 20 Hz light 
stimulation (5 ms pulses, top trace, blue line shows onset of light pulse). They 
also responded with an action potential to 250 pA, 5 ms current pulses injected 
at 20 Hz (middle trace). Rabies-virus-expressing cells also showed spontaneous 
post-synaptic excitatory and inhibitory currents (EPSCs and IPSCs, 
respectively) when clamped at —70 mV (bottom trace, 0 pA holding for this 
cell). e, Current/voltage curves are similar in retrobead (RB, grey circles, n = 5 
cells) and rabies-virus-expressing cells (black circles, n = 3 cells). f, Average 
action potential for 11 retrobead-positive BLA-NAc projectors (grey) and six 
BLA-NAc projectors expressing rabies virus. g, Membrane properties of 
retrobead-positive versus rabies-virus-expressing neurons. None of the 
properties investigated were significantly altered in rabies-virus-expressing 
neurons (unpaired t-tests: access resistance, ty; = 1.299, P = 0.2135; membrane 
resistance, t)5 = 2.057, P = 0.0575; membrane capacitance, ty; = 1.215, P = 
0.2430; action potential threshold, t;; = 0.0756, P = 0.9407; holding current, 
tye = 1.002, P = 0.3314). Results show mean and s.e.m. 
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Extended Data Figure 8 | Morphological reconstructions of individual BLA 
neurons projecting to NAc or CeM. Morphological reconstructions of all 
neurons used for Sholl analysis performed by Imaris software (Fig. 4i). 


Classification of each neuron as pyramidal or stellate is indicated in the top left 
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corner of each reconstructed neuron (triangle or star, respectively). Each atlas 
schematic represents 1.5 mm X 1.5 mm area and the corresponding 
anteroposterior stereotaxic coordinates (relative to Bregma) are shown below. 
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Extended Data Figure 9 | RNA-seq identification of candidate genes 
differentially expressed in NAc- and CeM-projecting BLA neurons. 

a, Candidate differentially expressed genes were required to be enriched in only 
one group (either CeM or NAc projectors) in two independent experiments 
(NAc projectors collected from n = 8 mice; CeM projectors collected from n = 
9 mice, total) at the indicated quantile fold-change threshold (light-blue 
column). One of the chance estimates (‘flip-flopped’, see Methods) is taken 
from genes that passed the quantile thresholds but were enriched in the 
opposite groups in the two experiments. Another chance estimate (‘permuted’, 
see Methods) is determined based on an analysis in which fold differences for 
each gene were permuted across genes within each of the two experiments 
before determining differential expression. A 0.02 quantile threshold was 
chosen to identify differentially expressed candidate genes in order to balance 
specificity and sensitivity, resulting in an estimated false discovery rate of 
41.5%, calculated as the number expected by chance (flip-flopped) divided by 
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the number of differentially expressed genes (see Extended Data Fig. 9c for 
candidate gene list). In Fig. 4k, a 0.01 quantile threshold was chosen to identify a 
more conservative list of differentially expressed candidate genes at a lower false 
discovery rate of 26.2%. b, Distribution of differentially expressed genes 
between NAc and CeM projectors from RNA-seq experiments 1 and 2 (see 
Methods). Light-blue shaded areas represent the 2nd and 98th percentiles of the 
distributions. c, RNA-seq heat map showing normalized expression levels of 
differentially expressed genes in NAc- and CeM-projecting BLA neurons. 
Differentially expressed genes were required to be enriched in either NAc or 
CeM projectors in two independent experiments (samples used in experiment 1 
are indicated in black text below the heat map; experiment 2 samples are 
indicated in blue text) at a 0.02 quantile threshold (Extended Data Fig. 9a). Each 
RNA-seq library was prepared from 35-60 manually sorted retrobead-labelled 
cells taken from the BLA. 
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NIK1-mediated translation suppression functions as 
a plant antiviral immunity mechanism 


Cristiane Zorzatto’*, Joio Paulo B. Machado2*, Kénia V. G. Lopes!?, Kelly J. T. Nascimento”, Welison A. Pereira’, 
OtavioJ. B. Brustolini’?, Pedro A. B. Reis’, IaraP. Calil!?, Michihito Deguchi’, Gilberto Sachetto-Martins””, BiancaC. Gouveia’, 
Virgilio A. P. Loriato!?, Marcos A. C. Silva”, Fabyano F. Silva*, Anésia A. Santos”, Joanne Chory”* & Elizabeth P. B. Fontes)? 


Plants and plant pathogens are subject to continuous co-evolution- 
ary pressure for dominance, and the outcomes of these interactions 
can substantially impact agriculture and food security’ *. In virus- 
plant interactions, one of the major mechanisms for plant antiviral 
immunity relies on RNA silencing, which is often suppressed by 
co-evolving virus suppressors, thus enhancing viral pathogenicity 
in susceptible hosts’. In addition, plants use the nucleotide-binding 
and leucine-rich repeat (NB-LRR) domain-containing resistance 
proteins, which recognize viral effectors to activate effector-triggered 
immunity in a defence mechanism similar to that employed in non- 
viral infections~*. Unlike most eukaryotic organisms, plants are not 
known to activate mechanisms of host global translation suppres- 
sion to fight viruses’”. Here we demonstrate in Arabidopsis that the 
constitutive activation of NIK1, a leucine-rich repeat receptor-like 
kinase (LRR-RLK) identified as a virulence target of the begomovirus 
nuclear shuttle protein (NSP)**, leads to global translation suppres- 
sion and translocation of the downstream component RPL10 to the 
nucleus, where it interacts with a newly identified MYB-like protein, 
L10-INTERACTING MYB DOMAIN-CONTAINING PROTEIN 
(LIMYB), to downregulate translational machinery genes fully. 
LIMYB overexpression represses ribosomal protein genes at the tran- 
scriptional level, resulting in protein synthesis inhibition, decreased 
viral messenger RNA association with polysome fractions and enhanced 
tolerance to begomovirus. By contrast, the loss of LIMYB function 
releases the repression of translation-related genes and increases 
susceptibility to virus infection. Therefore, LIMYB links immune 
receptor LRR-RLK activation to global translation suppression as an 
antiviral immunity strategy in plants. 

NIK1 was first identified as a virulence target of the begomovirus 
NSP**. For begomoviruses, a group of single-stranded DNA viruses 
that infect major crops, the success of infection relies not only on viral 
suppressors of RNA silencing* but also on the viral inhibitor, NSP, of 
the immune receptor, NIK1 (ref. 5). The NIK1 protein belongs to the 
same LRR-RLK subfamily as the well-characterized PAMP recognition 
co-receptor BRI1-ASSOCIATED RECEPTOR KINASE 1 (BAK1)”*. 
NIK1 is involved in plant antiviral immunity’, whereas BAK] is required 
for plant immunity against bacteria, fungi and oomycetes through its 
interactions with multiple PAMP-recognition LRR-RLKs’. We have 
previously demonstrated that the activation of NIK] kinase is induced 
by the phosphorylation of Thr 474 within the activation (A)-loop'®*” 
(Supplementary Discussion 1). Apart from the identification of RPL10 
as a downstream effector in NIK1-mediated antiviral immunity”, 
mechanistic knowledge of the signalling pathway is lacking, and the 
molecular nature of the defence response remains unclear. In this study, 
we replaced the normal NIK1 receptor with the NIK1 phosphomimetic 
gain-of-function mutant T474D"' in transgenic Arabidopsis lines to un- 
derstand the molecular basis of the NIK1-mediated defence mechanism 


(Extended Data Fig. la-c). Transgenic lines possessing the gain-of- 
function mutant T474D in the nik1 knockout background”? were chal- 
lenged with infectious clones of the Arabidopsis-infecting begomovirus 
cabbage leaf curl virus (CaLCuV)’*". Then, we compared the virus- 
induced and T474D-induced transcriptomes at 10 days post-inoculation 
(dpi). A global cluster analysis of the expressed sequences among the 
mock-treated and infected wild-type (Col-0), NIK1 and T474D lines 
(Supplementary Table 1) revealed that the transcriptomes of the in- 
fected wild-type and mock-inoculated T474D lines were most closely 
related; these samples clustered together with a high bootstrap proba- 
bility and a high approximately unbiased P value (Fig. 1a), which sug- 
gests that the NIK1-mediated response and the response to begomovirus 
infection share similar mechanisms. These transcriptomes differed greatly 
from the NIK1 mock-inoculated transcriptome, indicating that virus 
infection activates the NIK1-mediated response. Moreover, the gain- 
of-function T474D mutant might be activated in a constitutive manner 
that allows it to support a sustained NIK1-mediated response, in con- 
trast with the expression of the intact NIK1 receptor in the nik1 genetic 
background. The transcriptome from NIK1-complemented lines clus- 
tered with the Col-0 mock-inoculated transcriptome. 

We also employed these transgenic lines to assess the T474D- 
induced global variation in gene expression. Gene enrichment analyses 
of immune system category genes indicated that ectopic expression of 
T474D did not activate typical viral defences, such as salicylic acid sig- 
nalling or virus-induced gene silencing (Supplementary Table 2, Extended 
Data Fig. 2a, b and Supplementary Discussion 2). Among the differenti- 
ally expressed genes, we observed an overrepresentation of translational- 
machinery-related genes, which largely predominated the downregulated 
gene list (Extended Data Fig. 3a, red spots; Supplementary Tables 2 and 3). 
Using enrichment analysis, these downregulated genes included ribo- 
somal genes and other components of the protein synthesis machinery. 
Therefore, T474D ectopic expression downregulates components of the 
translational machinery, suggesting that the constitutive activation of 
NIK1 might influence translation. To confirm that protein synthesis 
was impaired by constitutive activation of NIK] in the T474D lines, we 
labelled leaf proteins in vivo with [°°S]Met in the control and nik1 
plants, as well as in the NIK1- (ref. 11), G473V/T474A- (ref. 11) and 
T474D-expressing lines (Fig. 1b and Extended Data Fig. 3b). The one- 
proportion statistical test indicated a significant decrease (12.8% for 
T474D line 4 (T474D-4) and 13% for T474D-6; P < 0.05) in the amount 
of newly synthesized protein in T474D-expressing leaves compared 
with wild-type and NIK1-expressing leaves. We also demonstrated that 
dexamethasone-inducible T474D expression for 8h led to a higher 
inhibition of de novo protein synthesis in the transgenic lines (Fig. 1c 
and Extended Data Fig. 3c-e). In the dexamethasone-inducible lines, 
the expression of T474D significantly reduced polysome (PS) and mono- 
some (NPS) fractions (12% total reduction) to a similar extent as it 
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Figure 1 | Constitutive activation of the NIK1 receptor suppresses global 
host translation and confers tolerance to begomovirus. a, Ward hierarchical 
clustering of the gene expression data from the Arabidopsis infection 
experiments at 10 dpi. The dendrogram provides two types of P values: 
approximately unbiased (AU; red) and bootstrap probability (BP; green). inf, 
infected plants; mock, mock-inoculated plants. b, Global translation 
suppression by T474D expression. In vivo labelling of leaf proteins with 
[°°S]Met was performed in Col-0 and transgenic seedlings. The labelling 
percentage was also normalized to the leaf chlorophyll content (Extended Data 
Fig. 1d). Estimated proportion (in %) with respective error bars showing 
standard error of the mean (s.e.m.) from three independent experiments 

(n = 3). Asterisks indicate that the proportion is significantly (P < 0.05) 
different from 100% using the one-sided (less) chi-squared test. GV/TA, 
G473V/T474A double-mutant lines. c, Inhibition of global translation upon 
induction of T474D expression. An oestrogen-receptor-based chemical- 
inducible system was used to induce T474D expression in Arabidopsis 
seedlings, which were treated with 10 1M dexamethasone (Dex) or ethanol 
(Eth) for 8 h, and then were pulse labelled with L- [°°S]Met for 1h. Tricarboxylic 
acid (TCA)-precipitable radioactivity in each sample was normalized to 
Col-0 + ethanol. Estimated proportion with respective error bars showing 
s.e.m. from three independent experiments (n = 3). Asterisks indicate that the 
proportion is significantly (*P < 0.05) different from 1 using the one-sided 
(less) chi-squared test. TR1 and TR2 are independently transformed lines 
harbouring T474D under the control of the dexamethasone-inducible 
promoter. d, Quantitation of total and PS-associated viral replication initiator 
protein (Rep) and host transcripts by qRT-PCR. The two-way analysis of 
variance (ANOVA) results were sliced by RNA fractions (total and PS). 
Different letters indicate significant differences among the RNA levels of the 
same gene by the TukeyHSD test (P < 0.01). mRNA/actin, mRNA of the gene 
normalized to actin mRNA. e, Symptoms associated with CaLCuV infection 
in wild-type and transgenic lines at 21 dpi. The figure shows representative 
samples from three independent experiments, each one with ten plants 
(biological replicates). f, Absolute quantitation of CaLCuV genomic units in 
infected lines at 14 dpi. An 18S ribosomal RNA (rRNA) target was run in 
parallel for normalizing the template load per reaction. Error bars, 95% 
confidence intervals based on bootstrap resampling replicates of four 
independent (n = 4) experiments. g, PS loading of viral mRNA in systemically 
infected leaves. PS-bound RNA was probed with viral Rep complementary 
DNA and 18S rDNA. 
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reduced PS and NPS RNA (13% reduction; Extended Data Fig. 4a-c). 
The loading of host mRNA (RBCS, Arabidopsis thaliana (At)WWP1, 
S13 and S39 genes) in actively translating PS fractions was significantly 
reduced in T474D-overexpressing lines compared to the wild-type line, 
although to a different extent (Fig. 1d, Extended Data Fig. 4d and Sup- 
plementary Discussion 3). Therefore, the activation of NIK1 reduces 
global levels of translation, but the effect may not be the same for all 
mRNAs. This downregulation of cytosolic translation might at least 
partially underlie the molecular mechanisms involved in NIK1-mediated 
antiviral defences. 

To examine whether the constitutive activation of NIK1 was effective 
at controlling begomovirus infection, the transgenic lines were inocu- 
lated with CaLCuV DNA-A and DNA-B. The wild-type plants dis- 
played typical symptoms of CaLCuV infection, whereas the symptoms 
in the T474D-expressing lines were greatly attenuated (Fig. le and 
Extended Data Fig. 5b). The symptomless CaLCuV infections of the 
T474D-expressing lines were associated with a delayed course of infec- 
tion (Extended Data Fig. 5a) and a lower accumulation of viral DNA in 
the systemically infected leaves (Fig. 1f). 

Because T474D expression caused downregulation of protein syn- 
thesis, we determined whether viral RNA translation was impaired in 
the T474D-expressing lines. We examined viral RNA transcripts in ac- 
tively translating PS fractions prepared from infected leaves at 10 dpi 
(Extended Data Fig. 4e-g), when the total viral mRNA accumulation 
in the Col-0 and T474D-expressing lines was not very dissimilar (Fig. 1d 
and Extended Data Fig. 5c). We observed a significant reduction in the 
PS loading of viral mRNA in the T474D infected leaves compared with 
the wild-type infected leaves (Fig. 1d, g), indicating that the begomo- 
virus was not capable of sustaining high levels of viral mRNA trans- 
lation in the T474D-expressing lines. 

Our data indicate that the translational suppression induced by 
NIKI constitutive activation is associated with the downregulation 
of translational-machinery-related genes. Hence, NIK1-mediated 
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Figure 2 | LIMYPB interacts with RPL10 in the nucleus. a, Jn vivo interaction 
between LIMYB and RPL10 by bimolecular fluorescence complementation 
(BiFC) analysis. The fluorescence (yellow fluorescent protein (YFP)) images 
were acquired using Nicotiana tabacum leaves co-expressing the 35S::RPL10- 
SPYNE + 35S::LIMYB-SPYCE and 35S::RPL10-SPYCE + 35S::LIMYB- 
SPYNE fusion proteins. They are representative samples from three 
independent biological repeats. Scale bars, 20 um. DIC, differential interference 
contrast. b, LIMYB and RPL10 interaction in planta. The indicated constructs 
were expressed in N. tabacum leaves, and co-immunoprecipitation (Co-IP) 
assays were performed using an anti-haemagglutinin (HA) antibody. CSN5A is 
an unrelated protein used as a negative control. c, Co-immunoprecipitation 
assay was performed using an anti-GFP antibody. 
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nucleocytoplasmic trafficking of RPL10 may be linked to the regulation 
of gene expression. The previously identified extraribosomal functions 
of RPL10 associated with transcription factor regulation’* '° may serve 
as potential targets for assessing this hypothesis. We used the two-hybrid 
system to search for RPL10 nuclear partners and isolated a MYB- 
domain-containing transcription factor, which was designated as LIMYB 
(Extended Data Fig. 6a-e). As a putative transcription factor, LIMYB 
localized in the nucleus of transiently or stably transformed plant cells 
(Extended Data Fig. 7a-i) and interacted with RPL1O in the nuclei of 
plant cells (Fig. 2a and Extended Data Fig. 6f, g). We further demonstrated 
that RPL10 and LIMYB interact in vivo using co-immunoprecipitation 
assays (Fig. 2b, c). 

The function of LIMYB in NIK1-mediated antiviral signalling was 
examined using several different approaches. We first demonstrated that 
LIMYB, RPL10 and NIK] are co-expressed in several organs (Extended 
Data Fig. 8). Then, we identified transfer DNA (T-DNA) insertion mu- 
tants (limyb-32 and limyb-82) in the LIMYB gene (Extended Data Fig. le, f). 
Wealso prepared LIMYB-overexpressing lines (Extended Data Fig. 1g) 
and determined the LIMYB-induced global variation in gene expression 
compared to Col-0 leaves. Remarkably, the overexpression of LIMYB 
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resulted in a downregulation of translational-machinery-related genes 
similar to that induced by T474D expression (Extended Data Fig. 1h, 
red spots). We selected five ribosomal protein (RP) genes to confirm 
the deep-sequencing results for the LIMYB-overexpressing lines by quan- 
titative polymerase chain reaction with reverse transcription (qRT-PCR) 
(Fig. 3a). LIMYB overexpression downregulated the expression of the 
selected RP genes but not of the unrelated gene (AtWWPI1; Extended 
Data Fig. 9c). Conversely, in the limyb-32 line, the RP genes were up- 
regulated (Fig. 3 and Extended Data Fig. 9a, d). Expression of LIMYB 
in limyb lines restored the wild-type expression of the RP genes (Ex- 
tended Data Fig. 9b). As in the T474D-expressing lines, protein synthe- 
sis was slightly but significantly reduced in the LIM YB-overexpressing 
lines (Fig. 3c and Extended Data Fig. 4h). Because RPL10 functions 
in NIK1-mediated antiviral signalling and interacts with LIMYB, we 
examined whether RPL10 also controls the expression of RP genes. In 
RPL10-overexpressing lines (Extended Data Fig. 1i, j), the expression 
of RP genes but not of the unrelated AtW WP1 gene was downregulated 
(Fig. 3d and Extended Data Fig. 9e). We then examined whether LIMYB, 
a putative transcription factor, binds to an RP promoter (RPL18) in vivo. 
We performed chromatin immunoprecipitation (ChIP) experiments 
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Figure 3 | LIMYB links NIK] activation to the downregulation of RP genes 
and confers tolerance to begomovirus. a, LIMYB overexpression 
downregulates RP genes. RP gene expression was monitored by qRT-PCR of 
RNA from LIMYB-overexpressing leaves and Col-0. b, Induction of RP 
genes in the limyb-32 mutant, monitored as in a. c, LIMYB overexpression 
suppresses global host translation. Col-0 and LIMYB-overexpressing seedlings 
were labelled with [°°S]Met in the presence and absence of chloramphenicol, 
as described in Fig. 1b. d, RPL10 downregulates RP genes, monitored as in a. 
e, LIMYB binds to the RPL18e promoter in vivo. ChIP assay was performed 
with LIMYB-GFP-expressing leaves or wild type using anti-GFP antibodies. 
The co-immunoprecipitated (Co-IP) 150-bp fragment of the RPL18 promoter 
was detected by PCR. C, amplification of the control plasmid. f, ChIP-qPCR 
assay of 35S:LIMYB-GFP transgenic and wild-type (Col-0) seedlings. Samples 
were immunoprecipitated with anti-GFP antibody or IgG antibody. ChIP 
DNA was quantified with real-time PCR. g, LIMYB or RPL10 repress the 

RP promoter. N. benthamiana leaves were agro-infiltrated with plasmids 
carrying the prL28e-luciferase (pL28e), prS23A-luciferase (pS13A) or prUb10- 
luciferase (pUb10) genes in combination with either the 35S::LIMYB or 


35S:RPL10 DNA constructs. Luciferase activity was measured 48 h after agro- 
infiltration. F/R, firefly luciferase activity to Renilla activity ratio. h, LIMYB and 
RPL10 determine the full transcriptional repression of the RP promoter. 

N. benthamiana leaves were agro-infiltrated with the indicated combination of 
DNA constructs and luciferase activity was measured 48 h after infiltration. 

i, Downregulation of RP genes in T474D-expressing lines, monitored as in a. 
j, T474D requires the function of LIMYB to mediate RP gene suppression. 
The limyb-32 (limyb) mutant was transformed with 35S::T474D, and the 
expression of RP genes was monitored as described in a. k, Symptoms 
associated with CaLCuV infection in Col-0 plants, the LIMYB-overexpressing 
line and limyb mutants at 21 dpi. Representative samples from three 
independent experiments, each one with ten plants (biological replicates). 

1, Absolute quantitation of CaLCuV genomic units in infected LIMYB- 
overexpressing lines and Jimyb mutants. Viral accumulation was determined by 
quantitative PCR at 14 dpi, as described in Fig. 1f. a—d, f-j, 1, The respective 
95% confidence interval limits were estimated based on bootstrap resampling 
replicates of three independent (n = 3) experiments and two technical 
repeats, except that in g, n = 2 with two technical repeats. 
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in which a 150 base pair (bp) RPL18 promoter was amplified from the 
precipitated DNA of LIMYB-expressing tissues but not of wild-type 
tissues (Fig. 3e). ChIP-qPCR showed that the 150-bp promoter frag- 
ment was significantly enriched in samples precipitated by anti-green 
fluorescent protein (GFP) antibody but not in samples pulled down 
from wild-type lines (Fig. 3f). These results suggest that LIMYB may 
function as a DNA-binding protein that associates with RP promoters 
in vivo. 

To provide further evidence for the regulation of RP target genes by 
LIMYB and RPL10, we performed a luciferase transactivation assay 
in agro-infiltrated leaves. Consistent with the gene expression profile, 
LIMYB and RPLI1O specifically repressed the L28e, $13A and L18E 
promoters but not an unrelated ubiquitin promoter (Fig. 3g, h). The 
co-transfection of Arabidopsis protoplasts with both LIMYB and RPL10 
promoted increased repression of the L18E promoter compared to reg- 
ulation of the gene reporter by individual expression of the transfactors. 
Collectively, these results indicate that LIMYB and RPL10 function as 
transcriptional repressors of common RP genes and that both trans- 
factors are required for full regulation (Fig. 3h). Therefore, LIMYB and 
RPL10 may coordinately regulate common target promoters. 

Wealso demonstrated that T474D expression downregulated the RP 
genes (Fig. 3i) but not AtWWPI (Extended Data Fig. 9f), confirming 
the RNA-sequencing data (Extended Data Fig. 3a). As an additional 
control, we showed that the double-mutant NIK1 G4743V/T474A™, 
which does not complement the enhanced susceptibility phenotype of 
the nik1-null alleles (Extended Data Fig. 5a), also does not affect RP 
genes (Extended Data Fig. 9g). Nevertheless, the loss of LIMYB func- 
tion prevented the T474D-mediated downregulation of the RP genes in 
both the limyb-32 transgenic lines that stably expressed T474D (Fig. 3) 
and Extended Data Fig. 9h, i) and the limyb-32 protoplasts that transi- 
ently expressed T474D (Extended Data Fig. 10). These results genetically 
link LIMYB to the translation suppression portion of the NIK1-mediated 
antiviral response. 

We predicted that if the suppression of host translation was the basis 
for the begomovirus-tolerant phenotype of the T474D-expressing lines, 
LIMYB overexpression would also be effective against CaLCuV and the 
loss of LIMYB function would further debilitate the plant upon begom- 
ovirus infection. Col-0, limyb-32, limyb-82 and LIMYB-overexpressing 
lines were inoculated with infectious clones of CaLCuV DNA-A and 
DNA-B (Fig. 3k). Both Col-0 and limyb lines developed typical CaLCuV 
symptoms, although to different extents. The disease symptoms varied 
in severity from extreme stunting and leaf distortion with severe chlor- 
osis in the limyb-32 and limyb-82 leaves to mild leaf distortion and 
moderate chlorosis in Col-0. The course of infection in the limyb leaves 
was accelerated compared to that in the Col-0 plants (Extended Data 
Fig. 5d) and the limyb-32 and limyb-82 systemic leaves accumulated 
higher levels of viral DNA than Col-0 (Fig. 31). Therefore, the loss of 
LIMYB function recapitulated the enhanced begomovirus susceptibility 
phenotype of the nik1-null alleles, as would be expected from a down- 
stream component of the NIK1-mediated antiviral defence. By contrast, 
the LIMYB-overexpressing lines did not develop symptoms, displayed 
delayed infection and accumulated a lower level of viral DNA in their 
systemic leaves, resembling the tolerant phenotype of the T474D- 
overexpressing lines. We also found that LLMYB overexpression did 
not induce salicylic-acid-signalling marker genes or typical defence re- 
sponses to viral infection (Extended Data Fig. 2c and Supplementary 
Table 2). Collectively, these results further support the notion that the 
inhibition of host translation observed in the T474D and LIMYB lines 
may be an effective mechanism exploited by plant cells to fight bego- 
movirus infection. Therefore, the demonstration that immune-receptor- 
mediated defence signalling controls translation in plant cells may 
represent a new paradigm for antiviral defences in plants (Supplemen- 
tary Discussion 4). 
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Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
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METHODS 

Plasmid constructs. The clone pk7F-NIK1T474D was described previously". This 
clone harbours a GFP gene fused in frame after the last codon of the respective mu- 
tant cDNA under the control of the CaMV 35S promoter. In the mutant cDNA 
T474D, the threonine residue at position 474 within the activation loop of NIK1 was 
replaced with an aspartate residue. The clones rpL10AST-pDONR201 (pUFV608), 
rpL10ANS-pDONR201 (pUFV609), rpL10AST-pDONR207 (pUFV900) and 
rpL10ANS-pDONR207 (pUFV901), which harbour the RPL10A coding region ei- 
ther with (ST) or without (NS) a translational stop codon inserted into the entry 
vectors pDONR201 or pDONR207, have also been previously described’*. The 
RPLI1OA coding region was transferred from these entry clones to the yeast ex- 
pression vectors pDEST32 and pDEST22, generating the clones pBD-RPL10A 
(pUFV 1422) and pAD-RPL10 (pUFV785) as GAL4 binding domain (BD) or acti- 
vation domain (AD) fusions. For BiFC, the RPL10A coding region was transferred 
to the vectors SPY-NE-GW and SPY-CE-GW. The resulting clones, pSPYNE- 
RPL10A (pUFV1652) and pSPYCE-RPL1OA (pUFV1653), harboured the RPL10A 
coding region fused to either the N-terminal (NE) region of YFP or the YFP C 
terminus (CE), respectively, under the control of the 35S promoter. The clones 
pK7F-L10 and pYFP-L10, which harbour RPL10A fused in frame with GFP after 
its last codon or with YFP before its first codon under the control of the 35S pro- 
moter, respectively, have been previously described’®. 

The clone pYFP-NLS-L10, which harbours a YFP-L10 fusion with a nuclear 
localization signal (NLS) under the control of the 35S promoter, was constructed 
by first inserting the amplified coding region of RPL10A into the Sacl site of the 
pGR vector. The resulting clone, pGR-L10, which contains the RPL10A coding 
region fused to a glucocorticoid receptor (GR) domain with an NLS at the 5’ end, 
was then used as the template for the amplification of the NLS-containing L10 
fusion with specific primers. The resulting product was inserted by recombination 
into pDONR207 and transferred to 35S-Y FP-casseteA-Nos-pCAMBIA 1300, gen- 
erating pYFP-NLS-L10. 

For the induction of T474D expression, the mutant cDNA T474D was trans- 
ferred by recombination from the pDON201-T474D entry vector to the destina- 
tion vector pBAV150 (ref. 17). The resulting clone pBAV150-NIK1 T474D-GFP 
harbours the mutant T474D open reading frame (ORF) under the control of a 
dexamethasone-inducible promoter and fused to a C-terminal GFP tag. 
Arabidopsis growth conditions and transformation. The Columbia (Col-0) eco- 
type of A. thaliana was used as the wild-type control, and the nik1 knockout line’® 
was used for plant transformation. The nik1 lines'® were transformed with pK7F- 
NIK1T474D or pBAV150- NIK1 T474D-GFP using the floral dip method. Two 
independently transformed lines expressing the T474D transgene (T474D-4 and 
T474D-6) were selected for the infection assays. The transgenic lines ectopically 
expressing NIK1-GFP (NIK1-5 and NIK1-8) or an inactive kinase, the double mu- 
tant of NIK1, and the G473V/T474A-GFP dead kinase (G473V/T474A-10 and 
G473V/T474A-8) have been previously described'’” (Supplementary Table 1). 
RT-PCR and real-time RT-PCR analyses. Total RNA was extracted from Arabi- 
dopsis leaves using TRIzol (Invitrogen). RT-PCR assays were performed using gene- 
specific primers, as previously described'*. For gene expression analysis by RT-PCR, 
gene-specific primers were designed using Primer Express 3.0 (Life Technologies). 
Real-time RT-PCR assays were performed on an ABI7500 instrument (Life Tech- 
nologies) using the SYBR Green PCR Master Mix (Life Technologies). The amp- 
lification reactions were performed as follows: 2 min at 50 °C, 10 min at 95 °C and 
40 cycles of 94 °C for 15 s and 60 °C for 1 min. The variation in gene expression was 
quantified using the comparative Ct method (2~“4“), and absolute gene express- 
ion was quantified using the 24“ method. The values were normalized to endo- 
genous actin and ubiquitin as control genes’’. 

RNA sequencing and data analysis. The transgenic and wild-type lines were in- 
fected at the seven-leaf stage with CaLCuV, as described later. After 10 dpi or after 
21 dpi, total RNA from systemically infected leaves, as diagnosed by PCR, and mock- 
inoculated leaves from wild-type, 35S::NIK1-5 and 35S::T474D-4 lines was isolated 
using TRIzol (Invitrogen). For the RNA-sequencing experiments, we used two bio- 
logical replicates of a pool of ten plants at 10 dpi, when we detected high levels of 
viral DNA in systemic leaves but symptoms had not yet developed, or at 21 dpi, when 
symptoms were visible. The RNA-sequencing data were obtained using an Illumina 
Hi-seq 2000. The paired-end 100-bp protocol was used with the following quality 
filter parameters: 5 bases trimmed at the 3’ and 5’ ends of the reads, a minimum 
average quality of 30 phred score. The data were stored in a comma-separated value 
spreadsheet file, and differential gene expression (DGE) analysis was performed 
using the R/Bioconductor package edger’”. The raw data were normalized using 
the TMM normalization factor”®. The dispersion was estimated by the tagwise edger 
parameter. Differential expression was determined using the cut-off P value of 0.1 
adjusted by the false discovery rate (FDR). The read mapping process was executed 
using the Bowtie program”' with the cDNA data set retrieved from The Arabidopsis 
Information Resource (TAIR) database (http://www.arabidopsis.org), tenth release. 
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Gene ontology classification was performed using the R/Bioconductor packages 
GSEABase and GOstats. Clustering analysis was performed using the R package 
pvclust (Hierarchical Clustering with P-Values via Multiscale Bootstrap Resam- 
pling) using Ward’s method”, and heatmaps were generated using gplots. The re- 
sults were stored in a relational database created in PostgreSQL, and a web interface 
was created using PHP to allow the database to be accessed and navigated (http:// 
arabidopsisnik inctipp.ufv.br). 

In vivo labelling of leaf proteins. Arabidopsis seedlings (300 mg) were incubated 
with 1 ml of nutrient solution containing 30 pg ml~' dexamathosone, 10 1g ml 
cycloheximide, 10 pg ml~! puromycin or 25 pg ml“! chloramphenicol for 1h, 4h 
or 8h. After the incubation period, 20 1Ci of [°°S]Met (EasyTag Protein Labelling 
Mix, [°°S]-, 2 mCi (74 MBgq), Perkin Elmer) was added for 1 h at room temperature. 
To quantitate the incorporation of [*°S]Met into protein, aliquots of protein extracts 
were placed in 10% (w/v) TCA and incubated on ice for 30 min. The samples were 
filtered onto glass microfibre filters, and the filters were washed three times with 
5 ml cold 5% (w/v) TCA and two times with 5 ml 95% ethanol. After drying, the 
filters were counted with a scintillation counter. 

PS fractionation. PSs were fractionated from 500 mg of 15-day-old Arabidopsis 
seedlings over sucrose gradients as described’’. Fractions were collected manually 
from the top, and total RNA was extracted with phenol/chloroform/isoamyl alco- 
hol, precipitated with isopropanol, and treated with DNase I. The specific tran- 
scripts were examined by northern blot analysis. For the infection assays, PSs from 
wild type, NIK1-4-overexpressing and T474D-6-overexpressing lines were isolated 
10 dpi with infectious CaLCuV clones. 

PS gradients of dexamethasone-inducible T474D seedlings were prepared from 

8-day-old seedlings, treated or not with 30 pg ml’ dexamathosone. After 8h of 
dexamethasone treatment, the seedlings were dried, frozen and ground in liquid 
nitrogen. Cytoplasmic extracts were prepared from 350 mg of powder that were re- 
suspended in the ice-cold extraction buffer. Cell debris was removed by centrifu- 
gation and 0.6 ml of the supernadant were loaded onto a 4.5 ml 15-55% sucrose 
density gradient and separated by ultracentrifugation. Fractions (16) of 310 il were 
collected from the top. Fractions 2-7 correspond to density regions of complexes 
=80S (NPSs) and fractions 8-15 correspond to density regions containing small 
and large polysomes (PSs). Quantitation of NPSs and PSs from the absorbance pro- 
file was performed as described previously™*. The quantitation of total RNA from 
the combined NPS and PS fractions was performed as described previously”’. To 
quantify mRNA loading into PSs, total RNA isolated from combined NPS (2-7) 
and PS (9-15) fractions was extracted using Trizol (Invitrogen), precipitated with 
isopropanol and quantified by qRT-PCR, as described earlier. RT-PCR on each 
fraction from the PS gradient of dexamethasone-inducible T474D seedlings was 
also performed for quantitation of the distribution of AtW WP1 and S39 transcripts 
after induction of T474D expression. Each sample was amplified with 1 cycle at 
50°C for 2 min, 95 °C for 10 min and 40 cycles at 95 °C for 15 s and 60 °C for 1 min. 
Values were normalized to actin. 
CaLCuV inoculation and viral DNA accumulation. A. thaliana plants at the seven- 
leaf stage were inoculated with plasmids containing partial tandem repeats of CaLCuV 
DNA-A and DNA-B by biolistic delivery”’, and the course of infection was mon- 
itored as described previously’®. We used an attenuated form of the virus in which 
the coat protein ORF in the CaLCuV DNA-A was interrupted by the introduction 
of a stop codon at amino acid position 47. The inoculated plants were transferred 
to a growth chamber and examined for symptom development (leaf necrosis, chlo- 
rosis, leaf epinasty, leaf curly, young leaf death and stunted growth). Total nucleic 
acids were extracted from systemically infected leaves, and viral DNA was detected 
by PCR with DNA-B CaLCuV-specific primers (566CLCVBFBRIv, 5’-GGCGTG 
GGGTATCTTACTC-3’, and 1253CLCVBRBRIc, 5’-GACATAGCATCGGACA 
TCC-3’) as well as the actin-specific primers as an endogenous control. In each 
experiment, 20 plants of each line (Col-0, nik1 and nikl-expressing NIK1 mutant 
proteins) were inoculated with 2 1g of tandemly repeated DNA-A plus DNA-B per 
plant. The course of infection was examined using data from three independent 
experiments. 

Viral DNA accumulation was measured by qPCR. The reactions were prepared 
ina final volume of 10 jl using the Fast SYBR Green Master Mix (Life Technologies) 
according to the manufacturer’s instructions and analysed on a 7500 Real Time 
PCR System. Virus-specific primers were designed using Primer Express 3.0 (Life 
Technologies) and tested by conventional PCR using plasmids containing the com- 
plete DNA-B of each virus (10° copies per reaction). The genomic copies of 
CaLCuV were normalized against an internal control (18S rRNA) to consider tem- 
plate input variation between tubes. CaLCuV DNA was amplified with primers 
B-Fwd (5’-GGGCCTGGGCCTGTTAGT-3’) and B-Rvs (5'-ACGGAAGATGGG 
AGAGGAAGA-3’). In this case, the genomic unit refers to one copy of the DNA-B 
of CaLCuV. PCR reactions were run in parallel with primers 18S-Fwd (5’-TAA 
TTTGCGCGCCTGCTGCC-3’) and 18S-Rvs (5’-TGTGCTGGCGACGCATCA 
TT-3’) for the reference plant gene 18SRNA. Standard curves were obtained by 
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regression analysis of the Ct values of each of the three replicates ofa given dilution 
as a function of the log of the amount of DNA in each dilution. Standard curves 
containing viral DNA and host DNA served as references. Each sample was ana- 
lysed in triplicate from at least two biological replicates. 

Two-hybrid screening. The yeast reporter strain MaV203 (MAToleu2-3,112 trp1- 
901 his3200 ade2-101 gal4 gal80 SPAL10::URA3 GALI::lacZ HIS3UAS GALI:: 
HIS3-LYS2 can1R cyh2R; Trp- Leu- Ura-) was transformed sequentially with pBD- 
L10 and 25 1g of the pEXAD502 cDNA library previously prepared” and ap- 
proximately 5 X 10° transformants were screened for histidine protoprophy an 
B-galactosidase activity, as described”””*. 

LIMYB-based plasmid constructs. The At5g05800 cDNA (LIMYB), which was 
isolated based on its ability to bind to rpL10A in yeast, was amplified by PCR from 
the cDNA library vector using AT5G05800-specific primers, re-amplified with the 
primers AttB1-Fwd and AttB1-Rvs, and cloned by recombination into the entry 
vectors PDONR201 and pDONR207. The resulting products were then transferred 
to different destination vectors for expression in yeast (pDEST32 and pDEST 22) 
and plants (pK7 FWG2, 35S-YFP-casseteA-Nos-pCAMBIA1300, pSPYNEGW and 
pSPYCEGW). The resulting clones, pBD-At5g05800ST (pUFV1903) and pAD- 
At5g05800ST (pUFV 1480), enabled the expression of AT5G05800 (LIMYB) in yeast 
as GAL4 binding domain (BD) or activation domain (AD) fusions, respectively. The 
clone pAt5g05800-GFP (pUFV1395) harboured the AT5G05800 (LIMYB) coding 
region fused to the GFP N terminus under the control of the 35S promoter, whereas 
in pYFP-At5g05800 (pUFV1886), the At5g05800 coding region was fused to the 
YFP C terminus. In the resulting clones, pSPYNE-At5g05800 (pUFV1658) and 
pSPYCE-At5g05800 (pUFV1657), the At5g05800 cDNA was linked to the YFP 
NE and the YFP CE, respectively, under the control of the 35S promoter. 

The HA epitope was fused to the At5g05800 C terminus using the triple Gate- 
way system, which consisted of the vector p5’, in which the 35S (2) promoter had 
been previously cloned, the clone pUFV1378 (At5g05800 cDNA without a trans- 
lational stop codon in pDONR201) and the vector p3’, in which a HA (6X) epitope 
had been previously inserted. The At5g05800 cDNA was transferred by recom- 
bination into the destination vector pPK7M34GW along with 2X35S promoter and 
the HA tail, yielding the clone p235S-At5g05800-6HA (pUFV1984), which enabled 
the expression of At5g05800 fused in frame to HA under the control of the 35S 
promoter in plants. The same triple Gateway system was used to place the LIMYB 
fused in frame with Cherry or GFP under the control of the LIMYB promoter. 
Transient expression in N. benthamiana leaves. To determine the subcellular 
localization of the proteins, N. benthamiana leaves were agro-infiltrated with 
pAt5g05800-GEP, pYFP-At5g05800 and pYFP-NAC81 (ref. 29), while for the 
co-immunoprecipitation assays, they were agro-infiltrated with pL10-GFP and 
p2X35S-At5g05800-6HA. Agrobacterium tumefaciens strain GV3101 was used for 
agroinfiltration. Agrobacterium infiltration was performed in the leaves of 3-week- 
old N. benthamiana, as previously described’. 

Confocal microscopy. Approximately 72 h after agro-infiltration, 1-cm* leaf ex- 
plants were excised, and the GFP and YFP fluorescence patterns were examined in 
epidermal cells with a X40 or X60 oil immersion objective and a Zeiss inverted 
LSM510 META laser scanning microscope equipped with an argon laser and a 
helium laser as excitation sources, as described’. 

Co-immunoprecipitation assays. The co-immunoprecipitation assay was per- 
formed using the }MACSTM Epitope Tag Protein Isolation Kit (MACS/Miltenyi 
Biotec) according to the manufacturer’s instructions. Total protein extracts were 
prepared from N. benthamiana leaves that had been agro-infiltrated with pL10- 
GFP and/or p235S-At5g05800-6HA. At 48h after infiltration, 200 mg of leaves 
were homogenized with 1 ml of lysis buffer (50 mM Tris-HCl pH 8.0, 1% (v/v) 
Nonidet P-40) and incubated for 2 h with anti-GFP magnetic beads (MACS/Miltenyi 
Biotec) at 4 °C under gentle agitation. After extensive washing of the magnetic beads, 
the bound proteins were eluted using 50 ml of elution buffer pre-warmed to 95 °C. 
The immunoprecipitated proteins were separated by 10% (w/v) SDS-PAGE and 
immunoblotted with anti-HA (Miltenyi Biotec, catalogue number 130-091-972) or 
anti-GFP (Miltenyi Biotec, catalogue number 130-091-833) monoclonal antibod- 
ies. The reacting antibodies were detected using Signal West Pico Chemilumines- 
cent Substrate (Thermo Scientific) according to the manufacturer’s instructions. 
BiFC. N. benthamiana leaves were agro-infiltrated as previously described for the 
transient expression experiments using the following combinations of recombin- 
ant plasmids: pSPYNE-At5g05800 + pSPYCE-rpL10A; pSPYCE-At5g05800 + 
pSPYNE-rpL10A; pSPYNE-At5g05800 + pSPYCE empty vector; pSPYNE- 
rpL10A + pSPYCE empty vector; pSPYCE-At5g05800 + pSPYNE empty vector; 
pSPYCE-rpL10A + pSPYNE empty vector. After incubation for 72 h, the leaf sec- 
tors were examined by confocal microscopy. YFP was excited at 514nm using an 
argon laser, and YFP emission was detected using a 560-615 nm filter. The stability 
of the CE and NE YFP regions was monitored by immunoblotting of transfected 
leaf protein extracts with a polyclonal anti-YFP serum. 


LIMYB- or RPL10A-expressing lines and T474D-expressing limyb lines. Arabi- 
dopsis Col-0 was transformed with pAt5g05800-GEP, generating the transgenic 
line LIMYB-1; with pYFP-At5g05800, generating the transgenic lines LIMYB-2 
and LiMYB-3; and with pYFP-NLS-L10, generating the transgenic lines RPL10-1, 
RPL10-2 and RPL10-3. Homozygous seeds of the T-DNA insertion limyb mutants 
(Salk_032054C and Salk_082995C) were obtained from the Arabidopsis Biological 
Resource Center. The limyb-32 mutant was transformed with the gain-of-function 
mutant T474D, generating the transgenic lines limyb/T474-1, limyb/T474D-2 and 
limyb/T474D-3. Primary transformants were selected using the appropriate anti- 
biotic (50 pg ml” ' kanamycin or 30 pig ml ~' hygromycin), and the stable incorpora- 
tion of the transgene in the plant genome was evaluated by PCR with gene-specific 
primers. The expression of the transgenes was monitored by qRT-PCR. 

LIMYB promoter construct and GUS histochemical assay. Approximately 1 kb 
of the 5’ flanking sequences of NIK1, RPL10A and At5g05800 (LIMYB) were am- 
plified from Arabidopsis genomic DNA using Taq Platinum and specific oligonu- 
cleotides and inserted into the cloning vector pCR8/GW/TOPO (Invitrogen). The 
promoter sequences were then transferred by recombination into the destination 
vector pMDC162. The resulting clones, pNIK1-MDC162, pL10A-MDC162 and 
pAt5g05800-MDC162 (pUFV 1892), harboured the respective promoter sequences 
of the three genes fused to the B-glucuronidase (GUS) reporter gene and were used 
to transform Arabidopsis Col-0 plants. GUS expression was assayed histochemically. 
Luciferase reporter gene assay. The At1g29970 (RPL18A) promoter was isolated 
from Arabidopsis DNA by PCR and cloned into pDONR-P4-P1R, generating 
pUFV2155. The clone LUCF-term-pDON221 (pUFV 2132) harbours the firefly 
luciferase cDNA in the entry vector pDON221, whereas the clone 2X 35S-RLUCF- 
pDONR-P2R-P3 (pUFV 2131) contains Renilla luciferase cDNA under the control 
of a 2X 35S promoter. The RPL18A promoter was transferred from pUFV2155 
to the destination vector pK7m34GW (pUFV 1918), along with 2x 35S-RLUCF- 
pDONR-P2R-P3 (pUFV 2131) and LUCF-term-pDON221 (pUFV 2132), by tri- 
ple recombination. The resulting clone, prAtlg29970-Lucif-term-2X35S RLucif 
pH7M34GW (pUVE2231), harboured the firefly luciferase cDNA under the control 
of the rpL18A promoter, as well as the Renilla luciferase cDNA under the control 
of a2X 35S promoter. The same procedure was used to clone the firefly luciferase 
cDNA under the control of the RPL28e, RPS13A and ubiquitin promoters for lucif- 
erase transactivation assays. 

N. benthamiana leaves were agro-infiltrated with A. tumefaciens GV3101 strains 
carrying the following combinations of DNA constructs: Atlg29970-Lucif-term- 
2X35S RLucif pH7M34GW; At1g29970-Lucif-term-2X35S RLucif pH7M34GW 
+ AT5G05800NS-pK7FWG2; Atl g29970-Lucif-term-2X35S RLucif pH7M34GW 
+ AT5G05800NS-pK7FWG2 + rpL10ANS-pK7FWG2; and Atlg29970-Lucif- 
term-2X35S RLucif pH7M34GW + rpL10ANS-pK7FW@G2Z. Forty-eight hours 
after infiltration, 200 mg of leaf tissue was harvested for total protein extraction. 
Luciferase activity was assayed with the Dual-Luciferase Reporter Assay System 
(Promega) according to the manufacturer’s instructions. 

Protoplast preparation from A. thaliana leaves and transient expression assays. 
Protoplasts were prepared from 15-day-old Arabidopsis seedlings as previously 
described”, except that digestion was initiated for 30s under vacuum and then 
prolonged for 6h with agitation at 80 r.p.m. The transient expression assays were 
performed by electroporating (250 V, 250 LF) 10 jig of the expression cassette DNA 
and 30 ig of the sheared salmon sperm DNA into 2 X 10°-5 X 10° protoplasts ina 
final volume of 0.8 ml. The protoplasts were diluted into 8 ml of MS medium sup- 
plemented with 0.2 mg ml’ 2,4-dichlorophenoxyacetic acid and 0.8 M mannitol 
at pH 5.7. After 36h of incubation in the dark, the protoplasts were washed with 
0.8 M mannitol plus 20 mM MES at pH 5.7 and frozen in liquid N> until further use. 
Statistical analyses. All statistical analyses (including gene expression clustering, 
qRT-PCR and protein synthesis data) were performed in R. 

Cluster analysis. Cluster analysis of the RNA-sequencing data was used to classify 
the treatments (mock-inoculated and infected wild type, T474D and NIK1) accord- 
ing to similarities in the profiles of genome-wide expression at 10 and 21 dpi. The 
uncertainty of clustering results caused by sampling variations was verified by the 
probability-based cluster analysis, which was implemented using the pvclust pack- 
age of R software. In this context, the bootstrap (BP) and approximately unbiased 
(AU) probability are used to validate the reported cluster. The BP and AU values 
are the percentage that a given cluster appears in the bootstrap and multiscale boot- 
strap replicates, respectively. 

RNA-sequencing differential gene expression analysis. The edgeR package of 
R/bioconductor software was used to carry out the gene expression analysis. This 
package assumes negative binomial distribution for the read counts, and the used 
normalization is given by the TMM (trimmed mean of M value) method. The sig- 
nificance of differential gene expression was reported in terms of q values (FDR- 
adjusted P values). 

Enrichment analysis. Gene set enrichment analysis (GSEA) was used to uncover 
biological processes associated with sets of differentially expressed genes instead of 
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focusing on individual genes. This analysis was implemented in the GOstats pack- 
age of R/bioconductor software by using the function GSEAGOHyperGParams, 
which uses the relationships among the gene ontology terms as extra information 
in the statistical inference of groups. 

Percentage of protein synthesis analysis. Analysis of protein synthesis was based 
on the one-proportion one-sided (less) test using chi-squared distribution. The tested 
hypothesis was the 100% synthesized labelled protein for each treatment; thus, if a 
given treatment is significant, it indicates that the proportion (or percentage) is 
different from 100%. 

Confidence interval analysis. Nonparametric bootstrap confidence intervals were 
used on our graphics to increase the accuracy of the confidence limits by an over- 
lapping analysis*’. This method was introduced as a nonparametric device for esti- 
mating standard errors and biases. It isan automatic algorithm for producing highly 
accurate confidence limits from a bootstrap distribution implemented with the 
‘boot’ package in R software. 

Phylogenetic analysis. MYB family sequences were retrieved from the Agris data- 
base (http://arabidopsis.med.ohio-state.edu). The alignment was performed by Maft 
aligner software’, and the tree was built by Fasttree** software. 
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Extended Data Figure 1 | Characterization of the Arabidopsis transgenic 
lines. a, Phenotypes of wild-type (Col-0) and nik1 plants transformed with 
NIK1 (NIK1-5 and NIK1-8), T474D (T474D-4 and T474D-6) or the 
double-mutant G4743V/T474A (inactive kinase, GV/TA-10 and GV/TA-4). 
Transgenic plants (R2 generation, n = 15) were grown in soil at 22 °C under 
short day conditions and photographed 2 weeks after planting. b, T474D 
transcript accumulation in transgenic lines (R2 generation). The expression of 
T474D or NIKI in the leaves of independent transgenic lines was monitored 
by quantitative RT-PCR. Mean + 95% confidence intervals (n = 3) based 

on bootstrap resampling replicates of three independent experiments. 

c, Accumulation of T474D-GFP in transgenic lines. Total protein was 
extracted from the leaves of independent transgenic lines (as indicated), 
immunoprecipitated and immunoblotted with an anti-GFP antiserum. 

d, Chlorophyll content of transgenic lines. Total chlorophyll, chlorophyll a and 
chlorophyll b were determined in leaf sectors of the indicated transgenic lines. 
Error bars, 95% confidence intervals (n = 3) based on bootstrap resampling 
replicates of four independent experiments. e, Transcript accumulation of 
LIMYB in T-DNA insertion mutant lines. LIMYB expression was monitored 
by qRT-PCR of RNA prepared from Col-0, limyb-32 (SALk_032054) and 
limyb-82 (SALK_082995) plants. Gene expression was calculated using the 
27 4* method, and actin was used as an endogenous control. Error bars, 95% 
confidence intervals (n = 3) based on bootstrap resampling replicates of three 
independent experiments. f, Schematic representation of the At5g05800 
(LIMYB) locus in the chromosome 5 and RNA sequencing data. The At5g05800 
gene harbours three introns and four exons. Triangles show the positions of the 
T-DNA insertion in the limyb-32 and limyb-82 mutants, and the blue line 
indicates the position of the amplicon from e. The relative abundance of the 
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mapped RNA hits in the At5g05800 locus is shown in red in limyb-32, black in 
Col-0 and green in limyb-82. The accumulation of LIMYB transcripts was much 
lower in limyb-32 and higher in limyb-82 than in Col-0. Sequencing of the 
limyb-32 and limyb-82 transcripts revealed unprocessed intron sequences and 
premature stop codons that would have prevented the translation of a 
functional protein in these mutant lines. Therefore, limyb-32 and limyb-82 were 
confirmed as loss-of-function limyb mutant lines. g, LIMYB transcript 
accumulation in LIMYB-overexpressing lines (R2 generation). LIMYB 
expression in the leaves of independent transgenic lines was monitored by 
quantitative RT-PCR. Error bars, 95% confidence intervals (n = 3) based on 
bootstrap resampling replicates of three independent experiments. h, General 
downregulation of translational machinery-related genes in LIMYB-1 
seedlings. The ‘MA’ plots show the log of the ratio of the expression levels 
against log concentration, and each dot represents a gene. This plot visualizes 
the contrast of LIMYB-1 and Col-0 seedlings. The smear of points on the 

left side indicates those genes that were observed in only one group of replicated 
samples, and the red points denote ribosomal and protein synthesis-related 
genes. CPM, counts per million; FC, fold change; WT, wild type. i, RPL10 
transcript accumulation in RPL10-overexpressing lines (R2 generation). The 
expression of an NLS-containing RPL10 transgene in the leaves of independent 
transgenic lines was monitored by quantitative RT-PCR. Error bars, 95% 
confidence intervals (n = 3) based on bootstrap resampling replicates of three 
independent experiments. j, Nuclear localization of the NLS-containing 
YFP-RPL1O fusion in transgenic lines. Root tips from transgenic seedlings 
expressing the NLS-containing YFP-RPL10 fusion were directly examined 
under a laser confocal microscope. The figure shows representative confocal 
images from five independent biological replicates. 
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Extended Data Figure 2 | Expression of immune-system-related genes in 
T474D-overexpressing lines and in LIMYB-overexpressing lines. a—c, On 
the basis of our global comparison of EST sequences (Fig. 1a) and the role 

of NIK as an antiviral receptor, we asked whether the constitutive activation of 
NIK would elicit a defence response similar to that induced by geminivirus 
infection via the salicylic acid pathway or typical defence responses to virus. No 
significant gene enrichment was detected in the virus-induced gene silencing 
(GO:0009616) and viral defence response (GO:0051607) categories using 

the gene set enrichment analysis (GSEA) method (Supplementary Table 2). For 
the immune system category, gene enrichment was found in both up- and 
downregulated changes using the GSEA method. However, the typical markers 
of salicylic acid signalling, such as PR1 and SNC1, were either non-differentially 


expressed or downregulated, and the expression of T474D did not enhance the 
salicylic acid level in the transgenic lines. Collectively, these results indicate that 
ectopic expression of T474D did not activate typical viral defences, such as 
salicylic acid signalling or gene silencing. a, b, Transcript accumulation of 
selected immune-system-related gene markers by RT-PCR (a) or RNA- 
sequencing in T474D-overexpressing lines (b). qRT-PCR of a representative 
sample confirmed an 80% match with the RNA-sequencing results. 

c, Transcript accumulation of the immune-system-related genes in LIMYB- 
overexpressing lines. The expression of the indicated genes in the leaves of 
independent transgenic lines was monitored by (RT-PCR. a, c, Mean + 95% 
confidence intervals (n = 3) are shown, based on bootstrap resampling 
replicates of three independent experiments. 
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Extended Data Figure 3 | Ectopic expression of T474D-D downregulates 
translational-machinery-related genes and suppresses de novo protein 
synthesis. a, Representation of the translational-machinery-related genes 
differentially expressed in the T474D lines. The ‘MA’ plots show the log of the 
ratio of expression levels versus the log concentration, and each dot represents a 
gene. This plot represents the contrast between the T474D mock-inoculated 
lines and the Col-0 mock-inoculated lines. The red points denote ribosomal 
and protein synthesis-related genes (as shown in Supplementary Table 3). 

b, Downregulation of global translation by ectopic expression of T474D in 
Arabidopsis. The in vivo labelling of leaf proteins with [°°S]Met was performed 
in 20-day-old Col-0 plants and T474D transgenic lines. The total protein 
extracts were fractionated by SDS—polyacrylamide gel electrophoresis (SDS- 
PAGE), and the radioactive bands were quantified by densitometric analysis of 
the images obtained by autoradiography. The labelling percentage was 
normalized to the leaf chlorophyll content, and the protein loading on the gel 
was adjusted to the Coomassie-stained band of the large subunit of rubisco. 
c, Induction of T474D expression by dexamethasone (DEX) in Arabidopsis 
transgenic seedlings. The constitutive expression of T474D was associated with 
stunted growth in the transgenic lines (Extended Data Fig. 1a) and repression of 
global protein synthesis (Fig. 1b). These phenotypes precluded the use of an 
appropriate normalization method for protein loading in comparative gels of 
contrasting genotypes to estimate precisely the T474D-mediated protein 
synthesis inhibition in our assay. To overcome this limitation, we used a 
dexamethasone-inducible promoter to control the T474D expression in the 
transgenic lines. Arabidopsis seedlings independently transformed with a 
T474D-GFP fusion under the control of a dexamethasone-inducible promoter 
(TR1 and TR2) were treated with 30 1M dexamethasone for 8h, and the 
induction of T474D-GFP expression was monitored by semi-quantitative RT- 
PCR. The dexamethasone-induced expression of T474D for 8h led to a higher 
inhibition of de novo protein synthesis in the transgenic lines, as measured 
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by TCA-precipitable radioactivity, which could be normalized to total protein 
(TR1 and TR2; Fig. 1c). d, Inhibition of de novo protein synthesis by protein 
synthesis inhibitors in Col-0, untransformed lines. We also compared the 
T474D-mediated suppression of translation with known global translation 
inhibitors, such as the cytosolic protein synthesis inhibitors cycloheximide and 
puromycin and the chloroplast translation suppressor chloramphenicol. 
Arabidopsis seedlings (10 days old) were treated with 10 1M cycloheximide 
(Cyclo), 10 4M puromycin (Pur), 25 uM chloramphenicol (Chlor) or 30 uM 
dexamethasone (Dex) for the indicated periods of time, and then they were 
pulse labelled with L-[?°S] Met for 60 min. Lysates of treated cells were measured 
by liquid scintillation counting and normalized to total protein. The relative 
[°°S]Met incorporation was normalized to wild-type (WT = 1) control 
without treatment. Means + 95% confidence intervals (n = 3) based on 
bootstrap resampling replicates of three independent experiments are shown. 
e, Inhibition of de novo protein synthesis by inducible expression of T474D. 
Seedlings (10 days old) from the TR1 transgenic line were treated with 
dexamethasone and the protein synthesis inhibitors for the times as indicated 
in the figure, and then they were pulse labelled with L- ig °§]Met for 60 min. 
Lysates were processed as described in d. Means + 95% confidence interval 
(n = 3) based on bootstrap resampling replicates of three independent 
experiments are shown. Cycloheximide was the most effective inhibitor of 
translation in both the wild-type and T474D-expressing lines. T474D 
expression inhibited global translation to the same extent as puromycin and 
caused a further inhibition in the level of chloramphenicol translational 
inhibition in a combined treatment. The increase in translational inhibition 
by combining T474D expression and chloramphenicol treatment may indicate 
that T474D inhibits cytosolic protein synthesis, which is consistent with the 
T474D-mediated downregulation of components of the cytosolic translational 
machinery (Supplementary Table 3). 
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Extended Data Figure 4 | Isolation of PS fractions from Arabidopsis 
seedlings and LIMYB-mediated inhibition of protein synthesis. 

a, Ultraviolet absorbance profiles of the sucrose gradient used for RNA 
fractionation of dexamethasone (Dex)-inducible T474D transgenic lines. 
Sixteen fractions of 310 11 were collected. NPS RNA includes complexes = 80S 
that fractionated in the top half of the gradient (fractions 2-7) and PS 
(polysome) represents complexes fractionated in the bottom of the gradient 
(fractions 9-15). b, Distribution (%) of NPS and PS fractions on polysome 
density gradients. The percentage of NPS or PS was calculated by integrating 
the areas under the corresponding peaks in the Ags4nm profile delimited 

by a gradient baseline absorbance (buffer density). Values are the 

average + standard deviation (s.d.) of three biological replicates. NPS and PS 
fractions from T474D + Dex samples were significantly different from the 
corresponding fractions of the T474D samples by the t-test (greater). NPS, 
*P value = 0.03725; PS, **P value = 0.009137 (t-test; greater). c, Relative RNA 
content in NPS and PS complexes. RNA was precipitated from NPS and PS 
density regions of sucrose gradients (as in a) and quantified. Relative NPS 
RNA and PS RNA contents from T474D and T474D + Dex samples were 
calculated in relation to the total NPS + PS content from T474D. Values for the 
relative NPS and PS RNA content are the average + s.d. of three biological 
replicates and they were significantly different between the samples (P < 0.01, 


LETTER 


t-test). d, Distribution of specific mRNAs in the PS gradient fractions from 
extracts prepared from T474D seedlings treated (or not) with dexamethasone. 
The RNA on each fraction was reverse transcribed and aliquots amplified with 
specific primers for the indicated genes by qPCR. e, Ultraviolet absorbance 
profiles of the sucrose gradient used for the RNA fractionation of Col-0 
seedlings. PSs from 15-day-old Col-0 seedlings were fractionated on a sucrose 
gradient, and the fractions were manually collected. f, Ultraviolet absorbance 
profiles of the sucrose gradient used for RNA fractionation from T474D 
seedlings. PSs from 15-day-old T474D-overexpressing seedlings were 
fractionated on a sucrose gradient, and the fractions were manually collected. 
g, Levels of the small subunit of rubisco (RBCS) mRNA per fraction. 

The levels of mRNA of RBCS were examined by northern blotting. This 
control was used to ensure the quality and distribution of a specific mRNA. 
h, Overexpression of LIMYB suppresses cytosolic translation. In vivo labelling 
of leaf proteins with [°°S]Met was performed in Col-0 and LIMYB-1 transgenic 
seedlings in the presence and absence of chloramphenicol treatment. The 
total protein extracts were fractionated by SDS-PAGE, and the radioactive 
bands were quantified by densitometric analysis of the images obtained by 
autoradiography. The labelling percentage was normalized to the leaf 
chlorophyll content, and protein loading is shown by Coomassie staining of 
the radioactive gel. 
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Extended Data Figure 5 | Ectopic expression of T474D and LIMYB confers 
tolerance to begomovirus infection. a, Delayed onset of infection in the 
T474D-4- and T474D-6-overexpressing lines. Ecotype Col-0 plants, as well 
as the T474D-4- and T474D-6-overexpressing lines, and the NIK1-8 and 
G473V/T474A-overexpressing lines were infected with CaLCuV DNA by the 
biolistic method. The progression of the infection was monitored by PCR 
detection of viral DNA in the systemic leaves of the inoculated plants. The 
values represent the percentages of systemically infected plants at different dpi. 
Error bars, 95% confidence intervals (n = 3) based on bootstrap resampling 
replicates of four independent experiments. b, Upon symptom development, 
the T474D-induced transcriptome diverges from the infected Col-0 
transcriptome. The mock-inoculated T474D-overexpressing lines exhibited a 
constitutively infected wild-type transcriptome at 10 dpi (Fig. 1a). Nevertheless, 
these T474D transgenic lines did not phenocopy the infected wild-type 
plants because they did not develop symptoms of viral infection. In fact, the 
wild-type plants displayed typical symptoms of CaLCuV infection at 21 dpi, 
such as leaf distortion, stunting with epinasty and chlorosis (Fig. le). The 
symptoms in the T474D-expressing lines at 21 dpi, however, were greatly 
attenuated, with no visible leaf distortion or chlorosis. To examine these 
phenotypes, we performed a Ward hierarchical clustering of the gene 
expression data (normalized by the trimmed mean of M-values (TMM) 
normalization method) from the Arabidopsis infection experiments at 21 dpi. 
The TMM normalization method assumes that the majority of genes are not 


*Non significative according to Negative Binomial 
(edgeR) by q-value (FDR) < 0.05 


T474D infected — Col-0 infected (10 days) 
ViralComponent log2FC q-value NCBI Locus ID 
A 0.9139909 1.381925e-01* CLU65529 
B 1.2348528 1.257184e-08 CLU65530 
T474D infected — WT infected (21 days) 
Viral Component log2FC q-value NCBI Locus ID 
A -4.534558 7.640602e-126 CLU65529 
B -3.682993 1.129406e-112 CLU65530 
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differentially expressed, and it adjusts genes with larger read counts and lower 
variance on the logarithmic scale. The dendrogram provides two types of 

P values: AU (black) and BP (grey). The AU P value comes from multiscale 
bootstrap resampling, while the BP value represents normal bootstrap 
resampling. These P values were calculated by multiscale bootstrap resampling 
using the R-cran package pvclust with a cut-off of 0.05. These P values show the 
significance of the proximity of each gene expression experiment profile. 

The cluster analysis at 21 dpi indicated that when symptoms had developed in 
the infected Col-0 leaves, the T474D-induced transcriptome diverged from the 
infected Col-0 transcriptome. The mock T474D transcriptome formed a 
unique clade, while the infected T474D transcriptome was more closely related 
to the mock-inoculated Col-0 transcriptome. c, Reduced viral transcript 
accumulation in T474D-overexpressing lines at 21 dpi. RNA-sequencing 
data of viral gene transcripts in the systemic leaves of infected wild-type and 
T474D-overexpressing plants at 10 dpi and 21 dpi. d, The onset of infection is 
delayed in LIMYB-overexpressing lines. Ecotype Col-0 plants and LIMYB- 
overexpressing and limyb mutant lines were infected with CaLCuV DNA using 
the biolistic method. The progression of the infection was monitored by the 
PCR detection of viral DNA in the systemic leaves of the inoculated plants. 
The values represent the percentages of systemically infected plants at different 
dpi. Error bars, 95% confidence intervals (n = 3) based on bootstrap 
resampling replicates of four independent experiments. 
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Extended Data Figure 6 | LIMYB, which belongs to the MYB domain- 
containing superfamily, interacts with RPL10 in the yeast two-hybrid 
system. a, LIMYB and RPL10 interact in yeast. LIMYB was expressed in yeast 
as a GAL4 activation domain (AD) fusion (pPAD-LIMYB), and RPL10 was 
expressed as a GAL4 binding domain (BD) fusion (pBD-RPL10). The 
interactions between the tested proteins were examined by monitoring His 
prototrophy. b, The interactions were further confirmed by measuring the 
expression activity of the B-galactosidase reporter enzyme for the second 
reporter gene, B-galactosidase. The interaction between pAD-RPL10 and 
pBD-NIK1 was monitored as a positive control. Means + 95% confidence 
intervals (n = 3) based on bootstrap resampling replicates of three technical 
replicates are shown. c, LIMYB harbours two MYB domains. The position of 
these MYB domains is indicated in the schematic representation of the 
LIMYB primary structure. d, Sequence identity of the closest related LIMYB 
(At5g05800) homologues. e, Dendrogram of MYB domain-containing proteins 
from Arabidopsis. The MYB family sequences were retrieved from the Agris 
database (http://arabidopsis.med.ohio-state.edu). The alignment was 
performed by Maft aligner software using full-length sequences, and the tree 
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was built by Fasttree software (the bootstrap values are indicated close to the 
branch divisions). The arrow indicates LIMYB (At5g05800). f, g, The negative 
controls used in the BiFC analysis. f, Confocal fluorescent image of SPYNE + 
LIMYB:SPYCE, LIMYB:SPYNE + SPYCE, SPYNE + RPL10:SPYCE and 
RPL10:SPYNE + SPYCE, as indicated in the figure. We used a BiFC assay to 
determine whether RPL10 and LIMYB interact in the nuclei of plant cells. 
The formation of a RPL10-LIMYB complex occurred in the nuclei of 
transfected cells independent of the orientation of the LIMYB or RPL10 fusions 
(amino terminus or carboxy terminus of YFP; Fig. 2a), and the reconstituted 
fluorescent signal was much higher than that of the background (control panels 
with combinations of the protein fusions with empty vectors). The figure 
displays representative samples from three independent biological repeats. 
Scale bars, 20 um. g, The C-terminal (SPYCE) and N-terminal region (SPYNE) 
of YFP accumulates detectably in co-transfected leaves. Total protein 

extracts from leaves co-transfected with the indicated constructs were 
immunoblotted with anti- YFP serum. The arrow indicates the position of 
RPL10-SPYCE and RPL10-SPYNE fusions, and arrowheads indicate the 
positions of the C-terminal (SYYCE) and N-terminal (SPYNE) regions of YFP. 
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Extended Data Figure 7 | Nuclear localization of LIMYB. a, Colocalization 
of LIMYB with the nuclear marker gene Glycine max (Gm)NAC81. 

N. benthamiana leaves were co-infiltrated with A. tumefaciens carrying a 
35S::LIMYB-GFP construct and a 35S::YFP-GmNAC81 construct. Forty-eight 
hours after infiltration, the subcellular localizations of the fluorescent fusion 
proteins were examined by confocal microscopy. The figure shows 
representative confocal images from two independent experiments. b, Confocal 
fluorescence image of transiently expressed GFP (left) or LIMYB-GFP (right) 
in epidermal cells of tobacco leaves. Scale bars, 10 jm. The figure shows 
representative confocal images from two independent experiments. 

c, Immunoblotting of transiently expressed LIMYB-GFP in epidermal cells of 
tobacco leaves. Total protein was extracted from agro-infiltrated N. 
benthamiana leaves containing the 35S::GFP (left lanes) or 35S::LIMYB-GFP 
(right lanes) constructs and immunoblotted with an anti-GFP monoclonal 
antibody to examine the stability of the fusion protein. The positions of 
molecular mass are shown in kDa. d, Confocal fluorescence image of transiently 
expressed GFP-LIMYB in epidermal cells of tobacco leaves. Scale bars, 10 um. 
The figure shows representative confocal images from four independent 
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experiments. e, Confocal fluorescence image of transiently expressed LIMYB- 
GFP in epidermal cells of tobacco leaves. Scale bars, 10 tm. The figure shows 
representative confocal images from four independent experiments. 

f, g, Confocal fluorescence image of root cells stably transformed with YFP- 
LIMYB or LIMYB-GFP. Root tips from transgenic seedlings expressing 
YFP-LIMYB (f) or LIMYB-GFP (g) were directly examined under a laser 
confocal microscope. Scale bars, 20 jum. The figures show representative 
confocal images from three biological replicas. Neither the fusion of YFP to the 
LIMYBN terminus nor GFP to its C terminus altered the nuclear localization of 
LIMYB in either agro-inoculated N. tabacum leaves or stably transformed 
Arabidopsis roots. h, i, Confocal fluorescent image of LIMYB fused to GFP or 
mCherry under the control of its own promoter. The figures show 
representative confocal images from two independent experiments. The 
fluorescence was also concentrated in the nucleus of transfected cells by 
expression of LIMYB-GFP or LIMYB-mCherry fusions under the control 

of the LIMYB endogenous promoter. Scale bars, 20 um. Collectively, these 
results indicate that LIMYB was localized in the nucleus. 
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Extended Data Figure 8 | LIMYB, RPL10 and NIK1 display overlapping 
expression profiles. a, pLIMYB::GUS, pRPL10::GUS and pNIK1::GUS are 
ubiquitously expressed in seedling tissues. GUS reporter gene expression was 
histochemically monitored in 2-week-old seedling leaves and roots from 
transgenic lines harbouring a B-glucuronidase (GUS) reporter gene expressed 
from the LIMYB, RPL10 and NIK1 promoters. The figure shows representative 
GUS staining images of three seedlings per genotype. All three genes were 
ubiquitously expressed in all seedling tissues. b, Expression analysis of LIMYB 
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in various plant organs. LIMYB expression was monitored by qRT-PCR of 
RNA prepared from leaves, roots, stems or flowers of Col-0 plants. Gene 
expression was calculated using the 2-““ method, and actin was used as an 
endogenous control. Error bars, 95% confidence intervals (n = 3) based on 
bootstrap resampling replicates of three independent experiments. LIMYB 
was also expressed in the leaves, roots, stems and flowers, indicating that 
LIMYB, RPL10 and NIK1 are co-expressed in several organs. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


At2g41020 (AtWWP1) 


mRNA / Actin 
mRNA / Actin 


limyb-32 


Col-0 —limyb-32 


b 
@Col-0 Glimyb-32 Glimyb/LIMYB-1 Blimyb/LIMYB-3 Glimyb/LIMYB-4 


= 
as] 
x 
=< 
Zz 
[ae 

E 


mRNA / Actin 


LIMYB 


At2g41020 (AtWWP1) 


mRNA / Actin 
oO 
fo) 


o °o 2 
Ny BR Oo 
mRNA / Actin 
oo: Oo So = = 
CoN, BR DO OD AN BK OD 


oO 


Col-O LIMYB-1 LIMYB-2 LIMYB-3 Col-0 


Extended Data Figure 9 | Controls for the regulation of RP gene expression 
experiments. a-i, Expression of the indicated genes in the leaves of 
independent transgenic lines was monitored by qRT-PCR. a, LIMYB 
expression in the limyb-32 mutant was examined. b, LIMYB expression in 
limyb-32 mutant restores wild-type expression of the RP genes. Expression of 
the S13a, L18A and L28e genes was monitored in three independently 
transformed limyb-32 knockout plants with the LIM YB gene. c-f, Expression of 
the unrelated gene AtWWPI1 was monitored as a negative control in three 
independently transformed LIMYB-, RPL10- and T474D-overexpressing lines 
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in addition to the limyb-32 mutant. g, The double-mutant inactive kinase, 
G4743V/T474<A, does not downregulate the RP genes. The transcript 
accumulation of the indicated RP genes was quantified by qRT-PCR in two 
independently transformed nik1 knockout lines expressing the G4743V/T474A 
double mutant. h, i, Expression of LIMYB (h) and the transgene T474D (i) was 
monitored in the limyb-32 lines, which were transformed with T474D. 

a-i, Means + 95% confidence intervals (n = 3) based on bootstrap resampling 
replicates of three independent experiments are shown. 
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Extended Data Figure 10 | T474D requires LIMYB function to mediate transfected protoplasts. Gene expression was calculated using the 27-4“ 
the downregulation of translational machinery-related genes. a-f, A method, and actin and ubiquitin were used as an endogenous control. 
35S::T474D-GFP construct was electroporated into protoplasts prepared from | Means + 95% confidence intervals (n = 3) based on bootstrap resampling 
Col-0 and limyb-32 seedlings, and the expression of the indicated RP genes replicates of three independent experiments are shown. 


was monitored by qRT-PCR of RNA prepared from untransfected and 
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A molecular mechanism of artemisinin resistance in 
Plasmodium falciparum malaria 


Alassane Mbengue’?*, Souvik Bhattacharjee’?*, Trupti Pandharkar'’, Haining Liu'*, Guillermina Estiu'*t, 
Robert V. Stahelin'**, Shahir S. Rizk’?, Dieudonne L. Njimoh’?>, Yana Ryan'”, Kesinee Chotivanich®, Chea Nguon’, 
Mehdi Ghorbal®, Jose-Juan Lopez-Rubio®, Michael Pfrender”, Scott Emrich’, Narla Mohandas’°, Arjen M. Dondorp°", 


Olaf Wiest!?!* & Kasturi Haldar? 


Artemisinins are the cornerstone of anti-malarial drugs’. 
Emergence and spread of resistance to them? * raises risk of wiping 
out recent gains achieved in reducing worldwide malaria burden 
and threatens future malaria control and elimination on a global 
level. Genome-wide association studies (GWAS) have revealed 
parasite genetic loci associated with artemisinin resistance*’°. 
However, there is no consensus on biochemical targets of artemi- 
sinin. Whether and how these targets interact with genes identified 
by GWAS, remains unknown. Here we provide biochemical and 
cellular evidence that artemisinins are potent inhibitors of 
Plasmodium falciparum phosphatidylinositol-3-kinase (P£PI3K), 
revealing an unexpected mechanism of action. In resistant clinical 
strains, increased Pf{PI3K was associated with the C580Y mutation 
in P. falciparum Kelch13 (PfKelch13), a primary marker of arte- 
misinin resistance. Polyubiquitination of PfPI3K and its binding 
to PfKelch13 were reduced by the PfKelch13 mutation, which lim- 
ited proteolysis of Pf{PI3K and thus increased levels of the kinase, as 
well as its lipid product phosphatidylinositol-3-phosphate (PI3P). 
We find PI3P levels to be predictive of artemisinin resistance in 
both clinical and engineered laboratory parasites as well as across 
non-isogenic strains. Elevated PI3P induced artemisinin resistance 
in absence of PfKelch13 mutations, but remained responsive to 
regulation by PfKelch13. Evidence is presented for PI3P-depend- 
ent signalling in which transgenic expression of an additional 
kinase confers resistance. Together these data present PI3P as 
the key mediator of artemisinin resistance and the sole PfPI3K as 
an important target for malaria elimination. 

Our previous work identified an important role for PI3P in protein 
export from the P. falciparum endoplasmic reticulum (ER) to the 
erythrocyte, at the early ‘ring’ stage of blood infection’’. 
Consequently, a secretory reporter that binds PI3P remains in the ring 
ER but in absence of PI3P undergoes default secretion to the parasi- 
tophorous vacuole (PV). This yielded a cell-based screen for drugs that 
inhibit PI3P production (Fig. 1a). We were particularly interested in 
artemisinins because clinical resistance to them develops at the early 
ring stage’. Low nanomolar concentrations of dihydroartemisinin 
(DHA), the active form of all artemisinins block production of PI3P 
(Fig. 1a). This effect is fast acting (within 30 min), reversed by washing 
out the drug and without effect on subsequent parasite growth 
(Extended Data Fig. la). Wortmannin or LY294002, active against 
the sole parasite PfPI3K'*’*, but not the inactive LY303511 blocked 


PI3P production. Artemisinin and artesunate were also inhibitory 
(Extended Data Fig. 1b, c), but deoxyartemisinin, anti-folates and 
aminoquinolines had no effect (Fig. 1a and Extended Data Fig. 1b-e). 
Biochemical analyses confirmed that DHA reduced mass PI3P levels 
and drug washout restored PI3P levels (Fig. 1b). Quantitative inhibi- 
tion of immunopurified PfPI3K was achieved by 4nM DHA but not 
by deoxyartemisinin (Fig. 1c). DHA at 10 uM failed to significantly 
inhibit 46 mammalian kinases, including its closest human ortho- 
logue VPS34 (a class III kinase; Fig. 1d and Extended Data Table 1) 
strongly supporting the conclusion that DHA is not a promiscuous 
kinase inhibitor. 

In the absence of a crystal structure of PfPI3K, a computational 
model can provide structural hypotheses to help understand experi- 
mental results. We used homology modelling of PfPI3K and 20 ns 
molecular dynamics (MD) simulations'* to study the binding of 
DHA and several structural analogues (Fig. le-g, Extended Data 
Figs 2 and 3a and Supplementary Data 1). The model suggests polar 
contacts of DHA with the D1889 hydroxyl and the Y1915 lactol oxy- 
gen of PfPI3K at the binding site (Fig. 1f) as well as an excellent shape 
complementarity of DHA with PfPI3K at its hydrophobic region 
(Fig. 1g). This is in agreement with the rapid inhibition of PfPI3K 
by DHA at nanomolar concentrations. MD simulation also rationalize 
the experimentally observed lack of inhibition of VPS34, mammalian 
class I or class II PI3 kinases by DHA (Extended Data Fig. 3b-d). 
Artesunate (at high 101M concentrations) has been reported to 
inhibit activities of the PI3K-AKT pathway in mammalian sys- 
tems!*'”, However, whether this was due to direct inhibition of 
PI3K or AKT, the DHA metabolite or other indirect effects (such as 
high concentrations that far exceed inhibitory concentrations for mal- 
aria parasites) was not studied. 

Together, the data in Fig. 1 suggest that DHA specifically targets 
PfPI3K. Yet GWAS studies suggest that >1,000 genes (including 
PfPI3K) show clinical resistance to artemisinins®; moreover PfPI3K 
polymorphisms are not detected in all resistant strains. Rather, there is 
selective pressure in regions of chromosome 13 (refs 5, 9) and in 
particular pfkelch13 (refs 7, 8, 10) but the mechanism is unknown. 
The mammalian orthologue of PfKelch13 functions as a substrate 
adaptor for an E3 ubiquitin ligase’*. The putative substrate binding 
domain of PfKelch13 shows characteristic ‘Kelch’ propeller domain”"* 
(Fig. 2a) mutations in which associate with artemisinin resistance”*””. 
We hypothesized that mutations may decrease affinity for a protein 
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substrate, thereby increasing its steady state levels by reducing ubiqui- 
tination and (proteosomal) degradation. 

We tested whether changes in PfPI3K levels were associated with 
PfKelch13 mutations in clinical and engineered laboratory strains. 
Analysis of clinical isolates from Cambodia’? (also see Methods) sug- 
gested a ~1.5-fold to twofold increase in PfPI3K levels in resistant 
versus sensitive strains (Fig. 2b). One sensitive clinical strain (ANL-7) 
contained the resistance mutation R539T: genome sequencing 
revealed it was not clonal suggesting contamination with a resistant 
parasite strain (data not shown). The PfKelch13 mutation C580Y was 
not present in clinical sensitive strains but detected in two of three 
resistant strains. CS80Y was also recognized to be the most prevalent 
mutation in resistant populations’® and thus investigated further in 
two distinct laboratory strains. We used parasites with chromosomally 
inserted PfKelch13@°™ in the P. falciparum NF54 (ref. 20) (Extended 
Data Fig. 4a). Additionally, we expressed a HA-tagged form of 
PfKelch13©°°Y in a second strain 3D7 (Extended Data Fig. 4b and 
Extended Data Table 2). Both mutated strains showed twofold to 
threefold increase in levels of PfPI3K relative to their PfKelch13" 
counterparts (Fig. 2c, d) without changes in levels of PfKelch13 
(Extended Data Fig. 4a, c). 

Further analysis revealed that immunopurified (IP) PfPI3K was 
bound to PfKelch13" and this was reduced by the C580Y mutation 
(Fig. 2e). K48-linked and atypical ubiquitination, both signals for pro- 
teosomal degradation were prominent in PfPI3K and concomitantly 
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Figure 1 | Dihydroartemisinin targets PfPI3K. 
a, SS-EEA1“"-mCherry detects ring PI3P in 
punctate (ER) domains''. Mutant SS-EEA1™3744. 
mCherry secretes to the PV'' (second row). 
Treatment with 4nM DHA redistributes SS- 
EEA1™'-mCherry to the PV. Washout restores ER 
PI3P. Treatment with 4nM deoxyartemisinin had 
no effect. Blue, nucleus; scale bar, 5 jim; P, parasite; 
E, Erythrocyte. Mean (s.d.) with three experimental 
replicates with image analysis from 400 optical 
sections. b-d, Effects of DHA on PI3P mass 

(b); immunopurified PfPI3K (raw data in 
Supplementary Data 2) (c); and mammalian 

PI3 kinases (d). Mean from three experimental 
replicates (each with triplicate data points). For 

b, s.d. <3; ¢, upper graph, s.d. <1.5; lower graph 
s.d. <5; d, s.d. <0.5. e, Overlay of the model of 
PfPI3K (cyan) and human class III PI3KVPS34 
(grey, PDB code 3IHY) with active site marked 
(asterisk). f, DHA in PfPI3K model (cyan) binding 
site. g, Surface representation of f. Additional 
details in Extended Data Figs 1-3. 


reduced by PfKelch13°*°Y. K63-linked ubiquitination”? was not 
detected suggesting ubiquitination drives degradation rather than a 
change in cellular localization and endocytic trafficking and PfPI3K 
continued to be cytoplasmically distributed as seen in absence of muta- 
tion (Extended Data Fig. 1f). Fragmentation detected in PfPI3K was 
diminished in presence of the C580Y mutation. Cleavage of the kinase 
began even before it was fully released from PfKelch13" and con- 
firmed that lack of effect of PfKelch13©%" was due to its failure to 
bind PfPI3K (Fig. 2f). Notably, Kelch adapters for E3 ligases are sub- 
strate specific and therefore our data well support the model in Fig. 2g. 
As PfPI3K is an early ring-stage target of artemisinins, elevation of the 
kinase and/or its products may provide a mechanism of artemisinin 
resistance at this stage. In an isogenic background, PfKelch13 muta- 
tion linked to resistance may increase levels of PfPI3K (Fig. 2c). None- 
theless equal amounts of PfPI3K across two distinct non-isogenic 
strains may not show equivalent activity for a variety of reasons, 
including different levels of precursor lipid substrate pools. We 
therefore examined whether PfPI3K activity, as measured by PI3P 
production provided a quantitative estimation of resistance across 
non-isogenic strains. PI3P can only be produced by (the sole) 
P£PI3K”’. PI3,4P2 and PI3,4,5P3 (derived from PI3P) are both detected 
as minor fractions of the total cellular PI3P and under 1% in rings’ 
the stage of clinical artemisinin resistance. Thus ring parasite PI3P 
levels directly reflect the activity of PfPI3K. Figure 3a indicated there 
was linear correlation between the levels of PI3P and resistance 
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Figure 2 | Proteostatic regulation of PfPI3K by 
PfKelch13. a, PfKelch13 ‘Kelch’ propeller domain. 
b, PfPI3K, PfKelch13 polymorphisms, PfPI3K 
levels in clinical (ANL) resistant (2, 4, and 9) and 
sensitive (1, 7) strains and laboratory strain 3D7; 
loading control PfFKBP. ANL-7, contaminated 
with resistance mutation R539T. c, d, Western 
blots show PfPI3K in NF54-PfKelch13©*" and 
NF54-PfKelch13“"; 3D7-PfKelch13*°"-HA 
and 3D7- PfKelch137-HA; loading controls, BiP, 
PfFKBP. e, Top left, PfKelch13 (arrow) binding to 
PfPI3K is reduced by PfKelch13~°*°”. In the 
PfKelch13™" background, IP PfPI3K displays 
K48-ubiquitination (top middle, arrow), atypical 
poly-ubiquitination (bottom left, asterisk) and 
degradation (bottom right), all reduced by 
PfKelch13°*Y, K63-ubiquitination was not 
observed (top right). IP lysate input PfFKBP. 

f, PfKelch13 but not PfKelch13@*” binds PfPI3K 
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(as measured by ring-stage survival assays (RSA)”’; Extended Data Fig. 
5a) in both clinical and genetically engineered laboratory strains. 

To elevate PI3P in the absence of PfKelch13, we transgenically 
expressed human VPS34 (Fig. 3b and Extended Data Fig. 5b-d). 
Human VPS34 synthesizes only PI3P and in the absence of 
PfKelch13 mutation increased ring PI3P and resistance in two inde- 
pendent 3D7 transgenic lines (Pf’?°**™" and PfY??*™); but a 
catalytically inactive VPS34 or a control reporter had no effect 
(Fig. 3c, d and Extended Data Fig. 5b-e). Introduction of 
PfKelch13" into VPS34 transgenic parasites reduced PI3P and 
RSA while the PfKelch13°*Y mutation reciprocally increased cellular 
PI3P and RSA levels (Fig. 3c, d and Extended Data Fig. 5b-f). The best 
fit line in Fig. 3d is closely comparable to that in Fig. 3a, suggesting the 
VPS34 transgenic lines remained responsive to PfKelch13. These data 
indicate that elevated PI3P levels confer artemisinin resistance 
(Fig. 3e). Notably, a ~2.5-fold increase in PI3P induced greater than 
one order of magnitude change in resistance suggesting that as PI3P is 
a signalling lipid, small changes in its levels may induce downstream 
activation of large amplitudes of resistance (Fig. 3e). 
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weightsin kDa. In b-f, data are representative of 
three experimental replicates. g, A model for Fig. 2. 
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data for b-f is in Supplementary Data 2. 
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We next queried whether additional cellular components could also 
influence PI3P and RSA levels. PISK-AKT is a primary signal trans- 
duction pathway in eukaryotes but its role is poorly understood in 
malaria parasites. In addition to PI3P, PfPI3K can also synthesize 
PI3,4,5P3 (ref. 12), which in most cells is needed for activation of 
AKT”. P. falciparum has an orthologue of AKT (PfAKT/ 
PF3D7_1246900; Extended Data Fig. 6a). However Pf{AKT appears 
different from its mammalian counterparts because it lacks a PH 
domain and a conserved Ser473. Rather unexpectedly, we found that 
DHA blocks cellular PEAKT activity (Fig. 4a) but did not inhibit puri- 
fied PfAKT (Fig. 4b). As it targets PfPI3K (Fig. 1), we reasoned that in 
parasites DHA may block PfAKT through inhibition of PfPI3K. 
Although PfAKT lacks a PI3,4,5P3-binding PH domain, it may function 
through a calmodulin-binding PH domain protein™ as PfAKT contains 
a calcium/calmodulin activator domain. As indicated earlier, low levels 
of PI3,4,5P3 are made by PfPI3K” (and in this regard, PfPI3K is also 
different from its closest homologue VPS34 which produces solely PI3P). 
Transgenic elevation of P[AKT (Extended Data Fig. 6b) induced a ~1.8- 
fold elevation of PI3P (presumably stimulated by feedback mechanisms) 
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Figure 4 | PI3P-mediated resistance linked to GWAS. a, b, Effect of DHA on 
cellular (a) and immunopurified (b) PfAKT. c, Expression of PEAKT-GFP in Pf 
3D7. Loading controls, PfFKBP, erythrocyte band 3. Molecular weights in kDa. 
RSA, PI3P, as indicated. In a-c, mean of three experimental replicates (each 

with triplicate data points). AKT activity and RSA s.d. <0.5; PI3P s.d. <5. In 
b and c, western blots are representative of three experimental replicates; raw 
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data in Supplementary Data 2. d, PI3P-mediated resistance pathways. Bold, 
lipid (PI3P) or protein components from this study and GWAS; non-bold 
indicates GWAS only®”’. Solid lines, established relationships; broken/dotted 
lines, partial functional validation; broad grey arrows, in silico prediction. 
Additional details in Extended Data Fig. 6. 
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and RSA of 6.5, which is comparable to the resistance level seen in one of 
the clinically derived lines (ANL-4; Fig. 4c). 

Evidence that PfAKT is a major downstream effector of PfPI3K is still 
lacking. Nonetheless, the data suggest that secondary genes may influ- 
ence levels of PfPI3K/PI3P product perhaps explaining why AKT (and 
other kinases) are also under artemisinin-selective pressure’. Differences 
between the pools of available substrates (phosphatidylinositol; PI, ATP) 
extent of ubiquitination (and additional unknown regulation) of PfPI3K 
may also be influenced by complex lipid, biosynthetic, transport and 
signalling networks (Fig. 4d) that vary across genetic backgrounds. 
Therefore absolute levels of PfPI3K across non-isogenic strains may 
not provide an accurate measure of PfPI3K activity/PI3P product and 
artemisinin resistance. Genes involved in lipid uptake” are also impli- 
cated in resistance and serum lipids are known to be essential for 
intracellular blood stage parasite growth’®. This may explain the wide 
range of RSA values displayed by resistant, clinical strains and why two 
resistant strains like ANL-2 and ANL-4 bear the same mutation C580Y 
and equivalent levels of PfPI3K, but different amounts of PI3P product 
and thus different levels of resistance. 

DHA has many other targets in the later trophozoite parasite stage””- 
**, but PfPI3K is the only known target in early ring stages: clinical 
resistance to artemisinins develops in early rings (and in absence of 
haemoglobin digestion associated with either late rings or trophozoites). 
The two most prevalent mutations (C580Y and R539T) associate with 
elevation of PI3P. All resistance mutations of PfKelch13 are limited 
to the Kelch propeller domain, suggesting they too regulate PfPI3K 
activity to contribute to differences in RSA. The molecular identities 
of other putative PfKelch13 targets and the ubiquitin ligase activity of 
the complex have yet to be investigated. PI3P may influence host remod- 
elling, and functions of the apicoplast and food vacuole’ as well as cell 
survival through redox, transcriptional and DNA repair pathways (grey 
arrows, Fig. 4d), all of which have also been implicated in artemisinin 
resistance’*”°°. Sustained, selective targeting of the PfPI3K and/or its 
regulation will be critical to developing new therapies that allow elim- 
inating artemisinin resistance and rendering effective malaria control. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Cloning and generation of constructs for P. falciparum transfection. All con- 
structs used for generating transgenic P. falciparum 3D7 parasites were assembled 
in different plasmids and finally cloned into pA150 or pA156"'. This resulted in the 
expression of the proteins under the calmodulin (cam) promoter. The generation 
of constructs pA156 (SS-EEA1“?-mCherry) and pA156 (SS-EEA1“7-mCherry) 
have been described earlier'’. 

For the generation of constructs pA150 (PfKelch13“’-HA), pfkelch13 
(PF3D7_1343700) was amplified from P. falciparum genomic DNA using 
PfKelch13AvrlIF and PfKelch13HAXholR; digested with AvrII/Xhol and cloned 
into corresponding sites of pA150. The construct pA150 (PfKelch13@°*°Y-HA) 
was generated using overlapping PCR strategy. Briefly, the products of PCR1 
(using PfKelch13AvrlIF and C580Yreverse) and PCR2 (using C580Yforward 
and PfKelch13HAXhoIR) reactions were used as template for the overlapping 
PCR3 to generate full length pfkelch13 (with residues conferring C580Y mutation 
in the protein sequence), which was subsequently digested with AvrII-Xhol and 
cloned into pA150 to generate pA150 (PfKelch13@°*°Y-HA). 

The plasmid containing human VPS34 (Cat# SC118487) was purchased from 
Origene Inc. (Rockville, MD, USA). This was used as template to PCR amplify 
human VPS34 using HsVPS34AvrIIF and HsVPS34mycXholR that was subse- 
quently digested and cloned to the AvrII/Xhol site of either pA156 (with bsd 
resistance cassette) or pA150 (with hdhfr resistance cassette) to generate pA156 
(VPS34-mycl) and pA150 (VPS34-myc2), respectively. The construct pA150 
(VPS34MUTANT-™Y¢1) was generated using overlapping PCR strategy. Briefly, the 
products of PCR1 (using HsVPS34AvrIIF and VPS34-742AAA745R) and PCR2 
(using VPS34-742AAA745F and HsVps34mycXholIR) reactions were used as 
template for the overlapping PCR3 to generate full length VPS34’"™"0"""" (with 
residues ’**DRH”** changed to AAA in the protein sequence), which was subse- 
quently digested with AvrII/XholI and cloned into pA150 to generate pA150 
(VPS34MUTANT-mycly, 

The construct pA150 (PfAKT-GFP) was generated as follows. Briefly, the full- 
length pfakt was amplified from P. falciparum genomic DNA using AKTAvrII_F 
and AKTBglII_R primers and cloned into AvrII/BglII site of pA150 (SSGFP)". 
This resulted in an in-frame fusion with gfp at the 5’ end. 

A single amino acid mutation replacing cysteine at position 580 with tyrosine 

(C580Y) in PfKelch13 (PF3D7_1343700) was generated using CRISPR-Cas9 
technology as described elsewhere”. 
Sequence analysis of pfkelch13 and pfpi3k polymorphisms in the clinical 
strains. Clinical strains were first propagated in tissue culture and genomic 
DNA isolated using Quick-gDNA Blood MiniPrep kit from ZYMO Research 
following manufacturer’s instructions. 

For the sequence analysis for polymorphism in pfkelch13, either the full length 
pfkelch13, the full-length gene was amplified using Kelch13_F and Kelch13_R 
primers, or specific regions were amplified using Kelch13-1, Kelch13-2, 
Kelch13-3, Kelch13-4, Kelch13-5 and Kelch13-6 primers and analysed. 

To check for pfpi3k polymorphism in clinical strains, specific regions of pfpi3k 

were amplified using PfPI3K-I682T-F/ PfPI3K-I682T-R, PfPI3K-Q431K-F/ 
Pf£PI3K-Q431K-R, PfPI3K-Y1330C-F/ PfPI3K-Y1330C-F primer pairs and 
sequenced. 
Generation of anti-peptide antibodies. All anti-peptide antibodies were custom- 
generated by Thermo Scientific (Rockford, IL, USA). Antibodies against P. falci- 
parum PI3K (Pf£PI3K, PF3D7_0515300) were generated in guinea pigs and rabbits 
against the C-terminal peptides CVDKLHEWALNWK and CVLKVQEKFRLD- 
LNDE, respectively. Antibodies were also custom-generated in rat, against the 
peptide RKRFDEERLRFLQEIDKI and corresponding to the amino acids 299- 
316 of PfKelch13 (PF3D7_1343700). Antibodies to BiP (PF3D7_0917900) were 
generated against the peptide DYFIKMFKKKNNIDLRTDKR corresponding to 
amino acids 261-280. For the generation of rabbit antibodies to PFAKT (RAC-beta 
serine/threonine protein kinase, PF3D7_1246900), amino acids corresponding 
to 229-246 (KKDKKIINLKKYNNAHRD) were selected. Antibodies were also 
custom-generated in rats, against the peptide REIVGDNTIEKKTEKALRE 
corresponding to amino acids 90-108 of PfExp-2 (PF3D7_1471100). 

All guinea pig antibody generations involved collection of control sera on day 0 
followed by immunization on day 1 with 0.1 mg of keyhole limpet haemocyanin 
(KLH)-conjugated peptide with complete Freund’s adjuvant (CFA) subcuta- 
neously at 4 sites. Booster immunizations were performed on days 21 and 42 with 
50 ug of KLH-peptide with incomplete Freund’s adjuvant subcutaneously. Test 
bleeds were checked by western blotting with uninfected and infected erythrocytes 
and corresponding animals were finally boosted again on day 62 followed by 
exsanguination bleed and peptide-affinity purification of the antibodies. 

For antibody generation in rabbits, control sera was collected on day 0 and 
animals were immunized on day 1 with 500 jg of KLH-conjugated peptides sub- 
cutaneously at 10 sites. Animals were again boosted on days 14 and 28 with 250 pg 


antigen and sera collected on day 35 to check the reactivity. Finally, after a booster 
immunization of selected animals on day 56/58, production bleed was collected on 
day 72 and processed for peptide-affinity purification of the antibodies. 

The reactivity of all custom-generated antibodies were confirmed by western 
blots using uninfected and infected red cells and used either for immunofluores- 
cence, immunoprecipitation assays or western blot analyses. 

Parasite culture, transfection and live cell imaging. P. falciparum 3D7 parasites 
were propagated in A+ human erythrocytes (bought from Biochemed Services, 
Winchester VA, USA: protocol approved by the University of Notre Dame) in 
RPMI supplemented with 0.5% Albumax II, 0.2mM hypoxanthine, 11mM 
Glucose, 0.17% NaHCO;, 10 pg/ml gentamycin and maintained at 37°C in 5% 
C0, in a humidified incubator. Synchronous P. falciparum 3D7 parasites in culture 
were transfected with indicated plasmids by standard procedures as described 
earlier'’. Forty-eight hours after transfection, the cultures were selected either with 
2.5 nM WR99210 (Jacobus Pharmaceuticals; for transfections involving pA150) or 
1 pg ml blasticidin hydrochloride until stable cells lines were obtained. Culture- 
adapted, clinical field strains of P. falciparum as well as NF54 parasites expressing 
PfKelch13" and PfKelch13*™ (ref. 20) were propagated in the same media at 
37°C in 90% No, 5% 02 and 5% C0, in a humidified incubator. Cultures were 
monitored daily by Giemsa staining of methanol-fixed smears and fed as neces- 
sary. Parasitemia were usually maintained below 12% for healthy culture growth. 
Patient sample collection and short term cultures. After hospital admission and 
informed consent, 10 ml of whole P. falciparum- infected blood was drawn from 
adults at multiple field sites in western Cambodia. White blood cells were removed 
over a CF11 column. The infected red blood cells were subjected to short term in 
vitro culture in RPMI1640 containing 20% human serum or Albumax II (0.5%) for 
a maximum of eight weeks. This study was approved by the Oxford Tropical- 
medicine Research Ethical Committee (OXTREC) as well as the Ministry of 
Health in Cambodia; the trial was registered under NCT00493363. The study 
was also approved by the University of Notre Dame. 

Treatments with PI3K inhibitors, wortmannin, LY294002, the inactive ortho- 
logue LY303511 and artemisinins. Highly synchronized parasites at schizont 
stages from P. falciparum 3D7 (transgenically expressing secretory SS-EEAIWT- 
mCherry) were purified using percoll density gradients. Purified schizonts were 
then allowed to invade fresh batch of red blood cells for 6h. The resulting intra- 
cellular rings were exposed to mock treatment (0.1% DMSO) or the following 
compounds: wortmannin, LY294002, inactive LY294002 analogue LY303511 
(from Selleckchem), artemisinin, artesunate or dihydroartemisinin (DHA) (from 
Sigma-Aldrich) at the indicated concentrations in DMSO (0.1%). Drug treatments 
were carried out as indicated for 30 min or 4h. Cells were subsequently processed 
for live cell imaging, immunofluorescence assay (IFA) and western blotting. Drugs 
were removed by washing thrice with serum-free RPMI, and infected erythrocytes 
were imaged after 1 h. In addition, where indicated parasite growth in culture was 
monitored 24h later in Giemsa smears. 

Imaging of live and cells by fluorescence microscopy and quantitative ana- 
lyses. For live cell imaging, parasites were processed as described previously*’. In 
both live and fixed cells, the parasite nuclei were stained with Hoechst 33342 and 
cells were imaged with a X100, NA 1.4 objective on an Olympus IX inverted 
fluorescence microscope with a temperature controlled stage and a Photometrix 
cooled CCD camera (CH350/LCCD) driven by DeltaVision software from 
Applied Precision (Seattle, WA). 

Quantitative analysis for the fraction of perinuclear SS-EEA1“"-mCherry 
fluorescence was undertaken using DeltaVision software. Briefly, consecutive 
0.2 micron optical z-sections per cell were observed for either perinuclear/ER-like 
fluorescence pattern or peripheral/PV-like pattern under control or drug-treated 
condition and represented graphically. 400 optical sections were quantified per 
treatment (10 in-focus optical sections of the parasite per infected cell). While 
perinuclear/ER fluorescence was an indication of no observable effect on the PI3P 
level, peripheral/PV fluorescence was an indication of a reduction in PI3P level on 
drug treatment. 

Immunoprecipitation of PfPI3K and PI3-kinase assay. Pf{PI3K (PF3D7_0515300) 
was immunoprecipitated from parasite stocks previously frozen at —80°C. 
Proteins were extracted 1h at 4°C using extraction/lysis buffer containing 
10mM Tris. HCl, pH7.5, 100mM NaCl, 5mM EDTA, 1% Triton X-100, 
100 1M sodium orthovanadate, 20 1M sodium fluoride, 20 1M B-glyceropho- 
sphate, and 1X protease inhibitor cocktail, Roche Diagnostics; 10% glycerol). 
After removal of insoluble debris, the extract was incubated with guinea pig anti- 
PfPI3K for 12 h at 4 °C. Protein A agarose beads (pre-equilibrated in lysis buffer) 
was then added and additionally incubated for 6 h. Immune complexes bound to 
beads were extensively washed with extraction buffer. PfPI3K-bound beads were 
either processed for western blotting or kinase activity assay. For western blotting 
beads were directly boiled in reducing SDS-PAGE sample buffer, resolved 
by SDS-PAGE and probed with rabbit anti-PfPI3K. Kinase assay of PfPI3K in 
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immunoprecipitated beads was measured using Class III P13-Kinase kit from 
Echelon Biosciences (K-3000) following the manufacturer’s instructions. Mean 
(s.d.) from three experimental replicates, each containing duplicate data points 
are shown. 

Mammalian kinase assays. The kinase activity of mammalian kinases was per- 
formed by Life Technologies (Grand Island, NY, USA) in the absence or presence 
of 10 1M DHA, through SelectScreen Biochemical Kinase Profiling Service, and 
expressed as percentage inhibition. 

Immunolocalization by indirect immunofluorescence assays (IFA). Indirect 
immunofluorescence assays (IFAs) were performed on non-transfected P. falci- 
parum 3D7 or transgenic parasites fixed with glutaraldehyde/paraformaldehyde as 
described previously” using the following antibody concentrations- rabbit anti- 
PfPI3K (custom-made, 20 'g ml’), guinea pig anti-Exp-2 (custom-made, 20 1g 
ml~') and mouse anti-myc antibodies from Abcam (Cambridge, MA, USA). The 
appropriate FITC- or TRITC labelled secondary IgG antibodies (ICN 
Biochemicals) were used at 1:200 dilution. Parasite nuclei were stained with 5 mg 
ml ! Hoechst 33342 (Molecular Probes) and slides were mounted with DABCO. 
Immunoprecipitation and western blotting. PfPI3K was immunopurified from 
P. falciparum 3D7 transgenic parasites (3D7-PfKelch13”'-HA and 3D7- 
PfKelch13“*°-HA) using guinea pig anti-PfPI3K as described above. 
Immunoprecipitations using Protein A agarose beads were used as negative con- 
trol. Western blots were subsequently performed using custom-generated rabbit 
anti-PfPI3K, rat anti-PfKelch13 antibodies, commercial rabbit anti-polyubiquitin 
(MBL-PW8810), K48-linkage specific polyubiquitin (Cell Signaling Technology- 
4289S), K63-linkage specific polyubiquitin (Cell Signaling Technology-5621S) or 
mouse monoclonal anti-HA tag antibody from (Abcam-ab18181) antibodies. 

To measure the relative amount of PfPI3K expression in P. falciparum 3D7 and 
clinical isolates, 4 X 10° infected erythrocytes permeabilized with 0.01% saponin 
followed by extensive washing with PBS to remove haemoglobin. Cells were the 
solubilized in Laemmli’s sample buffer, resolved by SDS-PAGE and western 
blotting performed using guinea-pig anti-PfPI3K, followed by HRP-conjugated 
donkey-anti guinea pig antibodies from Jackson ImmunoResearch (West Grove, 
PA, USA). Blots were developed using chemiluminescence assay kit from Thermo 
Scientific (Rockford, IL, USA). Antibodies to the parasite protein PfFKBP served 
as a loading control. The intensities of PfPI3K and PfFKBP signals were quanti- 
tated by densitometry and expressed as a ratio. 

The relative amount of PfPI3K expression in the transgenic parasites NF54- 
PfKelch13“", NF54-PfKelch13*°", 3D7-PfKelch13“"-HA and  3D7- 
PfKelch13“°*°Y-HA was measured by resolving the saponin-permeabilized pellet 
from 2 X 10’ parasites by SDS-PAGE and western blotting using guinea pig anti- 
PfPI3K followed by HRP-conjugated donkey-anti guinea pig antibodies and 
chemiluminescence detection. Antibodies to parasite proteins BiP and PfFKBP 
were used as loading controls. Antibodies to BiP were custom generated, while 
antibodies to PfFKBP were kindly provided by Nirbhay Kumar. The intensities of 
PfPI3K and PfFKBP signals were quantitated by densitometry and expressed as a 
ratio. 

Transgenic P. falciparum 3D7 parasites, either expressing myc-tagged VPS34 
(VPS34-mycl and VPS34-myc2), those expressing PfKelch13“7-HA/ 
PfKelch13*°Y-HA, as well as those expressing PfKelch13?-HA/ 
PfKelch13*°Y-HA in VPS34-mycl background were also checked for the 
expression by western blotting. Primary antibody either involved mouse anti- 
myc antibodies from Abcam (Cambridge, MA, USA) or rabbit anti-HA antibodies 
(Thermo Fisher Scientific, Grand Island, NY, USA), followed by HRP-conjugated 
secondary antibody from Bio-Rad (Hercules, CA, USA). Blots were developed 
using chemiluminescence detection kit. Antibodies to parasite PfFKBP or BiP 
were used as loading controls. 

Expression of SS-EEA1*!°744.mCherry was detected using Living Colours 
polyclonal rabbit anti-DsRed2 (Clontech, Mountain View, CA, USA). 

Expression of PfAKT-GFP in transgenic parasites was confirmed by western 

blotting of uninfected erythrocytes, non-transfected 3D7 parasites and transgenic 
(PfAKT-GFP) parasites, using rabbit anti-GFP antibodies from Thermo Fisher 
Scientific (Grand Island, NY, USA) and HRP-conjugated secondary goat anti- 
rabbit antibodies from Bio-Rad (Hercules, CA, USA). Antibodies to parasite 
PfFKBP and human band 3 were used as loading controls. 
Model building, docking and molecular dynamics (MD) simulations. The 
homology model of P. falciparum PfPI3K was build based on the structure of 
the class III PI3-kinase from Drosophila (PDB code 2X6F)”, using the multiple 
threading alignment in I-TASSER**** using additional information for the defini- 
tion of the active site from the structure of the human class III PI3K (PDB code 
3IHY). 

The known PI3K inhibitor, LY294002 (ref. 36) was docked to the model using 
GlideXP”’ to generate the starting structure for model refinement. The initial 
structure for the PI3K-dihydroartemisinin (DHA) complex was generated by 
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removing the LY294002 ligand from the refined structure and docking DHA to 
this model using GlideXP. 

This model of PfPI3K then served as the initial structure for 20 ns MD simula- 
tions. The model in these simulation consisted of the protein, surrounded by a 
periodic box of TIP3P** water molecules that extended 10 A from the protein. Na* 
counter ions were placed by the LEaP module of AMBER12” to neutralize the 
system. Ionizable residues were set to their normal ionization states at pH 7. 

The MD simulations were carried out using the PMEMD module of AMBER12 
(ref. 39). The protein was modelled using the ff03.r1 version of the AMBER force 
field while the DHA ligand was represented by the GAFF force field**". Atom- 
centred partial charges were generated based on B3LYP/6-31G* optimized geo- 
metries using RESP**’ as implemented in the antechamber module of AMBER12. 
A time step of 2 fs, combined with the SHAKE algorithm“ to constrain all bonds 
involving hydrogen atoms, was chosen. A non-bonded cutoff of 10.0 A was used, 
and the non-bonded pair list was updated every 25 time steps. 

The temperature (300K) and pressure (1atm) of the NPT ensemble were 
controlled by Langevin dynamics and isotropic position scaling* respectively. 
Long-range interactions were treated by the Particle-Mesh-Ewald (PME)***” 
methods with a grid spacing of ~1 A and a fourth-order B-spline interpolation 
to compute the potential and forces in between grid points. The trajectories were 
analysed using the PTRAJ module of AMBER12. The model of the PfPI3K-DHA 
complex was simulated analogously for a total time of 20 ns. 

The same MD protocol was used for the study of the complexes of artelinic acid, 
artemisone, and deoxyartemisinin in the Pf{PI3K model as well as DHA in VPS34 
and p110y. In short, initial structures were generated by docking of the small 
molecules to the PfPI3K model, human VPS34 (PDB code 3IHY), and human 
p1l0y (PDB code 4ANV). After manual inspection of the generated poses, the 
initial models were subjected to 20 ns MD simulations as described above. 
Assay for measuring cellular and immunoprecipitated PfAKT activity. 
P. falciparum 3D7 early-ring stage parasites were synchronized by sorbitol treat- 
ment and exposed either to mock treatment: 0.1% DMSO (control/mock) or 4nM 
DHA for 3h. An aliquot was washed using serum-free RPMI and returned to 
culture for 6h (washout). AKT activity was measured in mock-treated, 4nM 
DHA-treated and post-washed DHA treated cells (4X 10’) using the K-LISA 
Akt Activity Kit (CBA019) from Calbiochem (Darmstadt, Germany) following 
manufacturer’s instructions. Briefly, phosphorylation of AKT biotinylated peptide 
substrate was detected with phosphoserine antibodies. Serial dilutions of active 
human Aktl1/PKBa (Millipore) were used as standard and absorbance read at 
450nm on a plate reader. Mean (s.d.) from three experimental replicates, (each 
with triplicate data points) are shown. 

Endogenous PfAKT was also immunopurified from P. falciparum 3D7 follow- 

ing saponin treatment (as described earlier) and the resulting parasite pellet 
extracted with extraction buffer (25mM Tris/HCl pH 8.0, 150mM NaCl, 1% 
(v/v) Triton X-100, 1% (w/v) deoxycholate-Na, 2 mM EDTA containing complete 
protease inhibitor for 1h at 4°C. Pf{AKT was immunopurified using 10 ug of 
custom-generated antibodies (for 2 h at 4°C, under shaking conditions) followed 
by protein A agarose beads for 1 h. Protein A agarose beads incubated with parasite 
lysate in absence of antibodies served as negative control. Immunopurified 
PfAKT-beads and control beads were extensively washed in PBS, pH7.4 and 
processed further for the measurement of activity in the absence or presence of 
4nM DHA as described previously. Immunoprecipitation was also confirmed by 
western blotting using anti-PfAKT antibodies. 
Measurement of PI3P level in clinical and laboratory strains. Early ring stage 
parasites from P. falciparum 3D7 laboratory strains (non-transfected or trans- 
genic) and clinical strains were tightly synchronized with sorbitol treatment and 
lysed with saponin to remove haemoglobin. Resulting pellets (corresponding to 
6 X 10” parasites) were washed with PBS. Lipids were extracted and PI3P level 
assessed (in triplicates) using Echelon PI3P Mass ELISA Kit (K-3300) following 
the manufacturer’s instructions. 

PI3P level was also measured in 3D7 parasites under control condition, after 3 h 

treatment with 4nM DHA, as well as after 6 h in culture following DHA washout. 
Mean (s.d.) from three experimental replicates, (each with triplicate data points) 
are shown. 
Ring-stage survival assay (RSA). In vitro RSA was assessed as described by 
elsewhere”. Briefly, tightly synchronized early ring parasites (0-3 h post-invasion) 
were exposed to 700 nM DHA or 0.1% DMSO (control) for 6 h, washed thrice with 
serum-free RPMI and returned to culture for 66h. Blood smears were prepared 
and stained with Giemsa. Each sample was done in duplicate and 10,000 erythro- 
cytes were assessed independently by light microscope. Parasite survival rates 
(%RSA) were expressed as a percentage by comparing the number of viable para- 
sites between the drug-treated and untreated control. Mean (s.d.) of RSA based on 
four replicated carried out by two independent laboratory personnel. 
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Sample size. No statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | Ring Parasite PI3P and PfPI3K: effect of 
inhibitors, and characterization of antibodies. a, Rings at 6h post-invasion, 
were either mock treated or exposed to 4nM DHA for 4h. An aliquot was 
washed in serum-free RPMI and cultured for 24h. Parasitemia (ring and 
trophozoite stages) were determined by Giemsa staining and light microscopy. 
Mean (error bars s.d. <2) from three experimental replicates are shown. 

b, Effect of artemisinins, deoxyartemisinin, PI3K inhibitors wortmannin and 
LY294002 (and its inactive orthologue LY303511) on P13P level, as observed by 
redistribution of the PI3P reporter in transgenic 3D7 parasites expressing SS- 
EEA1“'-mCherry (red). Blue, parasite nucleus, P, parasite, E, erythrocyte; 
scale bar, 5 um. Fluorescence and phase-merged images are as indicated. 

c, Quantitative analysis of the data in b. Mean (error bars s.d. <21) from three 
experimental replicates are shown. d, Effect of anti-folates and chloroquine on 
PI3P production, as observed by redistribution of the PI3P reporter in 
transgenic 3D7 parasites expressing SS-EEA1'-mCherry (red). Blue, parasite 
nucleus, P, parasite, E, erythrocyte; scale bar, 5 um. Fluorescence and phase- 


merged images are as indicated. e, Quantitative analysis of the data in d. Mean 
(error bars s.d. <27) from three experimental replicates are shown. As 
indicated in Fig. 1, the PI3P reporter SS-EEA1’-mCherry (red) detects PI3P 
in punctate, perinuclear ER domains in 6h rings" (top panel) or reduced PI3P 
when the reporter diffuses to the PV (periphery). f, Schematic showing 
structural features of PfPI3K. Custom antibodies were generated in rabbits and 
guinea pigs against the indicated peptides. Western blots indicate that both 
antibodies specifically recognize a ~255 kDa protein in infected (IE) but not 
uninfected (U) erythrocytes and are representative of 10 experimental 
replicates; raw data in Supplementary Data 2. Indirect immunofluorescence 
assays using rabbit (top panel) and guinea pig (bottom panel) antibodies 
confirm the PfPI3K (green) is localized in the parasite. Exp-2 (red) marks the 
parasite boundary. Blue, parasite nucleus, P, parasite, E, erythrocyte; scale bar, 
5 tm. Fluorescence and phase-merged images are shown. Data are 
representative of 10 experimental replicates. 
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Extended Data Figure 2 | Structural analyses of P{PI3K. Sequence 
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Extended Data Figure 3 | Structural analyses of PfPI3K and human PI3K, 
with and without artemisinin derived compounds a Snapshots from 20 ns 
MD simulations and chemical structure of artelinic acid (top right), artemisone 
(bottom left) and deoxyartemisinin (bottom right) bound to PfPI3K. Snapshot 
from MD simulation of DHA bound to PfPI3K is shown (at the top left) for 
reference. Due to the lack of the hydroxyl group in artelinic acid and 
artemisone, no interactions involving D1889 and Y1915 were observed. 
Instead, the carboxylic group of artelinic acid forms a salt bridge with R1368 
and the sulfone moiety of artemisone interacts with R1368. Deoxyartemisinin 
lacks hydrogen bond donors, so D1889 is not involved in protein-ligand 
interaction. Instead, the carbonyl oxygen of deoxyartemisinin forms a 
hydrogen bond with Y1915. Boxed inset shows the change in the overall shape 
due to the replacement of the endoperoxide with an ether bridge in 
deoxyartemisinin (green) and compared to DHA (red), thus leading to loss of 
the shape complementarity and reorientation in the binding site. As shown in 
Fig. la, c, deoxyartemisinin failed to block PI3P production, in transgenic 3D7 
parasites expressing SS-EEA17-mCherry (red) or purified PfPI3K. Thus, 
modelling studies explain a lack of effect of deoxyartemisinin. They would also 
suggest lack of effect of artelinic acid and artemisone compounds on PfPI3K. It 
should be noted the target of these compounds is unknown and it remains 
unclear whether they would share a ring stage target with DHA important for 
clinical resistance to artemisinins. b, Snapshots of MD simulation of DHA- 
human VPS34 complex. The initial coordinates of DHA—human VPS34 
complex was obtained by the overlay of DHA...PfPI3K model (Extended Data 
Fig. 2b) with the human VPS34 crystal structure (PDB: 3IHY). The obtained 
complex was then subjected to 20 ns MD simulations. As can be seen, DHA 


LETTER 


does not interact with D644 and Y670, which correspond to D1889 and Y1915 
in PfPI3K, respectively. Instead, the hydroxyl group of DHA now interacts with 
D761 (left). Snapshot at the right shows an overlay of DHA...human VPS34 
complex with the PfPI3K model. In human VPS34 (grey), the loop that 
contains the D761 residue bends down to interact with DHA. In contrast, 
D2008 of Pf£PI3K (cyan), which is in the same position as D761 in human 
VPS34, has a different orientation which makes it unable to interact with DHA. 
c, Both D1889 and Y1915 in PfPI3K are conserved in human PI3K p110y. 20 ns 
MD simulations of the DHA ...p1107 complex (PDB code 4ANV), reveal that 
although the hydrogen bond between DHA and D841 still remains, the 
interaction between DHA and Y867 is lost. This suggests a weaker binding of 
DHA to p110y compared to PfPI3K. The same argument can be made of p1106 
based on sequence homology, but no crystal structure of human p1106 is 
available. d, Partial sequence alignment of PfPI3K, with human PI3K-C2a, 
PI3K-C2, p1106, p110y and VPS34. The key residues of PfPI3K (D1889 and 
Y1915) that interact with DHA and similar positions in the human kinases are 
coloured red. While D1889 in PfPI3K is conserved in both PI3K-C2« and 
PI3K-C2B, Y1915 in PfPI3K is replaced by F1172 and F1117 in PI3K-C2 and 
PI3K-C2f, respectively. Therefore, the hydrogen bond of the hydroxyl group in 
Y1915 suggested to be crucial in the computational analysis of Pf{PI3K cannot 
be formed with phenylalanine (F). AS far as we are aware, no crystal structures 
of PI3K-C2a and PI3K-C2f have been reported. In p1106, p110y and VPS34, a 
proline N-terminal to Y1915 is conserved. It is likely the reason for the different 
position and low flexibility of this loop observed in the MD simulations of 
p1106, and VPS34. 
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Extended Data Figure 4 | Transgenic P. falciparum expressing 
PfKelch13©°* mutation in two different strains of P. falciparum using 
distinct approaches and markers. a, Strategy 1: CRISPR-Cas9 used to 
introduce a single point mutation in the P. falciparum NF54 genome as 
described elsewhere”. Both parent and mutant strains were sequenced to 
ensure that the only change detected was PfKelch13“*°”. Western blots using 
anti-PfKelch13 antibodies confirm that mutation does not change the level of 
PfKelch13 protein. b, Strategy 2 expresses a dominant negative PfKelch13°°" 
and wild-type PfKelch13” tagged with HA, and driven by the constitutive cam 
promoter in P. falciparum 3D7. Western blots confirm that anti-HA antibodies 


p Strategy 2: Dominantnegative transgene 


Genomic Kelch13 NF54-C580Y 
TCATCAGCTATGTATGTT 


PfKelch13-C580 V4, 


PfKelch13“T-HA, 


3D7 
3D7 
3D7 


detected tagged forms of PfKelch13 in transfected but not non-transfected 3D7. 
Comparable levels of wild-type and mutant transgenes are expressed in 
resulting two strains of parasites (as shown). In a and b, BiP, an ER marker 
serves as a parasite loading control in western blots. Molecular weight standards 
(in kDa) are as indicated. c, Western blot indicate that antibodies to PfKelch13 
specifically recognize a ~83 kDa protein in infected (IE) but not uninfected 
erythrocytes (U). Molecular weight standards (in kDa) are as indicated. In 


a-c, data are representative of three experimental replicates; raw data are in 
Supplementary Data 2. 
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Extended Data Figure 5 | An in vitro ring-stage survival assay (RSA) level is 
associated with PI3P elevation in the presence and absence of Kelch 
mutations. a, Schematic of the RSA assay’. b, Summary of RSA, PI3P, 
PfKelch13 mutation and PfPI3K polymorphisms in clinical and laboratory 
strains used in this study. Means are shown for three experimental 

replicates. RSA s.d. <0.5; PI3P s.d. <8 (as shown by error bars in Fig. 3c). 

c, d, Construction of 3D7 transgenic parasites expressing myc- tagged forms 
of human VPS34, the mammalian orthologue of PfPI3K. The resulting 
pfVPS34+-mycl/? express the VPS34 transgene in ring stage parasites, as detected 
in indirect immunofluorescence assays (PfY?°**“77“N7 ™* data not shown in 
IFA). Exp-2 (red) marks the parasite boundary. Blue, parasite nucleus, P, 
parasite, E, erythrocyte; scale bar, 5 um. Fluorescent and merged images are 


shown. Whole genome Illumina sequencing indicated VPS34-mycl1 was 
inserted into PF3D7_1363900 and VPS34-myc2 is inserted in PF3D7_0718800. 
Both PF3D7_1363900 and PF3D7_0718800 encode for unknown function and 
neither has been reported in GWAS on artemisinin resistance. e, Construction 
of the strain 3D7-SS-EEA1*"*”*“-mCherry (used as transfection control) and 
western blot to confirm expression of SS- EEA1*'°”*“-mCherry (arrow). 
Molecular weight standards (in kDa) are as indicated. f, Construction of 
pfyPS34-mycl. PfKelch13*-HA and Pf’?S*4™Y*: PfKelch13°°"-HA parasites 
and western blots confirming expression of the transgenes. Raw data for 
western blots in e-f are shown in Supplementary Data 2. Molecular weight 
standards (in kDa) are as indicated. In d-f, data are representative of three 
experimental replicates. 
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P. falciparum 3D7 
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5’ hsp86 
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ITR1 


| Select with WR99210 


Transgenic parasites expressing Pf[AKT-GFP 


Extended Data Figure 6 | Evidence that the PI3K-AKT pathway is Supplementary Data 2. Data are representative of three experimental replicates. 
responsive to DHA. a, Western blot shows specificity of anti-PfAKT antibody __ b, Strategy and construct for generating transgenic 3D7 parasites expressing 
that recognizes a band of expected size in infected (IE) but not uninfected (U) | PfAKT-GFP. 

erythrocytes (1 X 10’ total cells loaded in each lane). Raw data are in 
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Extended Data Table 1| Percentage inhibition of mammalian 
kinases by 10nM DHA 


PRK2) 


s.d. <0.5. 
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Extended Data Table 2 | Primers used for cloning 


Primer 
Pfkelch 2AvrllF 


Pfkelich 1 3HAXhoIR 


C580Yreverse 
C580Y forward 
HsVps34AvrllF 
HsVps24mycXholR 


Vps34-742AAATA5F 
Vps34-742AAA7A5R 
AKTAvrll_F 
AKTBgIlI_R 
Kelch13_F 
Keloh12_R 
Keloh13-1 
Keloh12-2 
Keich12-3 
Kelch13-4 
Keloh12-5 
Kelch12-6 
PEPISK-I682T-F 
PEPISK-I682T-R 
PEPISK-O431K-F 
PIPISK-O431K-R 
PEPISK-Y 13200-F 
PEPISK-Y 1330C-F 


Sequence (5'-3'} 
AGATCTCCTAGGATGGAAGGAG AAAAA GTAAAAACAAAA GCAAATAG 


GGATCCCTCGAGTTAAGCATAATCT GGAACATCATAT GGA TAAGCAGCTGCTATATTTGCTATTAAA 
ACGGAGTGACCAAATCTGG 

TTATTATCAAAAGCAACATACATAGCT GAT GATCTAGG 
CCTAGATCATCAGCTATGTATGTTGCTTTT GATAATAA 
GGATCCCCTAGGATGGGGGAAGCAGAGAAGTTTCACTACATC 
GGATCCCTCGAGTTATAAATCTTICTTCAGATATTAATTTTI TGTTCAGCT GCTGCTTTTCTCCAGTACT 
GGGCAAACTTGTGAATC 

GGAGTTGGAGCTGCAGCACTGGATAACCTTTT GCTAACAAAAACAGGCAAACTC 
GTTATCCAGTGCTGCAGCTCCAACTCCAAGTATATAG GT GATCACGCAATATCC 
GTCCTAGGATGAATCATATAAACTITTTCAAATG 

GTAAGATCTTTITTGTTGACCT GAAAATTCATAATAG 
ATGGAAGGAGAAAAAGTAAAAACAAAAG 

TATATTTGCTATTAAAACGGAGTGACC 

ATGGAAGGAGAAAAAGTAAAAAC 

GATATGAGTGTATTAGATTCGAAC 

GATACTTATGAAAAGAAAATTATTG 

GAAAAAGAAAAATATTATCAAGAA 

CAACAATGCTGGCGTATGTGTACACC 

GATAATAAAATTTATGTCATTGGTGG 
CCTCATAGTACGAATAATACAAATCTCCAATCC 

ATATATTAATTCTTTAT TAAAAT TAAGGAAAT TCAAAATCAT GTATCC 
AACCTTTTCATATGATAACAATCCAAACAATATC 
TITTCTTTTTATATGTCTTATAATTGTAACGACTGG 
CCAAGAATAAAACAGACAGAATACAAAATAAAACC 


CTCATAGGAGTATCCAGCGAATTTTCC 
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Single-dose attenuated Vesiculovax vaccines protect 
primates against Ebola Makona virus 


Chad E. Mire'**, Demetrius Matassov**, Joan B. Geisbert'?, Theresa E. Latham’, Krystle N. Agans?, Rong Xu‘, Ayuko Ota-Setlik*, 
Michael A. Egan‘, Karla A. Fenton!?, David K. Clarke’, John H. Eldridge** & Thomas W. Geisbert!? 


The family Filoviridae contains three genera, Ebolavirus, 
Marburgvirus, and Cuevavirus'. Some members of the genus, 
including Zaire ebolavirus (ZEBOV), can cause lethal haemorrha- 
gic fever in humans. During 2014 an unprecedented ZEBOV out- 
break occurred in West Africa and is still ongoing, resulting in over 
10,000 deaths, and causing global concern of uncontrolled disease. 
To meet this challenge a rapid-acting vaccine is needed. Many 
vaccine approaches have shown promise in being able to protect 
nonhuman primates against ZEBOV’. In response to the current 
ZEBOV outbreak several of these vaccines have been fast tracked 
for human use. However, it is not known whether any of these 
vaccines can provide protection against the new outbreak 
Makona strain of ZEBOV. One of these approaches is a first-gen- 
eration recombinant vesicular stomatitis virus (rVSV)-based vac- 
cine expressing the ZEBOV glycoprotein (GP) (rVSV/ZEBOV). To 
address safety concerns associated with this vector, we developed 
two candidate, further-attenuated rVSV/ZEBOV vaccines. Both 
attenuated vaccines produced an approximately tenfold lower vac- 
cine-associated viraemia compared to the first-generation vaccine 
and both provided complete, single-dose protection of macaques 
from lethal challenge with the Makona outbreak strain of ZEBOV. 

Since discovery of the virus in 1976, outbreaks of ZEBOV have been 
detected sporadically in Africa. With increasing population growth the 
frequency of human contact with natural virus reservoirs’ is likely to 
rise, potentially leading to more catastrophic outbreaks such as the 
current epidemic in West Africa, thus increasing the need for effective 
antiviral strategies. A highly effective countermeasure would be a pre- 
ventive vaccine that can be simply and widely administered to people 
in regions of virus zoonosis and provide a ‘blanket immunity’ curtail- 
ing any future outbreaks. Also important will be the ability to rapidly 
combat deliberate misuse of these deadly viruses. Therefore, a prevent- 
ive vaccine should ideally confer rapid, single-dose protection. 

There are currently no licensed filovirus vaccines or post-expo- 
sure treatments available for human use. However, there are at 
least ten different vaccine approaches that have shown the poten- 
tial to protect nonhuman primates (NHPs) from lethal ZEBOV 
infection, including platforms based on recombinant adenovirus 
serotype 5 (rAd5) vectors, combined DNA/rAd5 vectors, com- 
bined rAd serotype 26 and 35 vectors, recombinant chimpanzee 
adenovirus serotype 3 (rChAd3) vectors, combined rChAd3 and 
modified vaccinia Ankara (MVA) vectors, virus-like particles 
(VLPs), alphavirus replicons, recombinant human parainfluenza 
virus 3 (rHPIV3), rabies virus, and recombinant vesicular stomat- 
itis virus (rVSV)’. Of the vaccines advancing to phase I trials, the 
rChAd3 and rVSV vectored vaccines have shown success in single- 
dose protection of NHPs against ZEBOV challenge; with the caveat 
that the rCchAd3/ZEBOV vaccine requires a boost with an MVA/ 
ZEBOV vector for protection past 6 months*. Also, NHPs inocu- 


lated with the rChAd3/ZEBOV vaccine were challenged with a 
ZEBOV seed stock containing a large virus population encoding 
8 uridines (U) ata critical transcription editing site in the GP gene’. 
This specific genetic feature typically arises following prolonged 
passage of ZEBOV in Vero E6 cells and results in higher levels of 
expression of full-length GP. In contrast, low-passage ZEBOV iso- 
lates retain 7U at the GP editing site, resulting in higher levels of 
secreted GP (sGP) expression, which is associated with greater 
viral virulence’. Importantly, studies have shown that rAd-based 
ZEBOV vaccines that completely protect NHPs against ZEBOV 
stocks containing high populations of 8U virus are not able to 
completely protect vaccinated macaques challenged with ZEBOV 
stocks containing high populations of 7U virus*. 

The first generation rVSV/ZEBOV vaccine that replaces the VSV 
glycoprotein G with the ZEBOV GP (rVSV/ZEBOVAG), originally 
developed by Drs Feldmann and Geisbert? and currently licensed by 
Merck, has demonstrated solid single-dose NHP protection against a 
low-passage 7U ZEBOV stock®. The rVSV/ZEBOVAG vector has also 
protected 50% of NHPs when administered shortly after ZEBOV chal- 
lenge’®, and has demonstrated safety in a NHP neurovirulence 
model". However, there is a robust post-vaccination viraemia in 
macaques and a recent phase I trial of the rVSV/ZEBOVAG vaccine 
in Geneva was halted due to temporary joint pain in some patients. 
The level of vaccine-associated viraemia and frequency of adverse 
events will be more fully documented as data from ongoing phase 3 
trials become available for this vector; but the early observation suggest 
that a further-attenuated rVSV vector may be more desirable for wide- 
spread administration in endemic regions of Africa. 

To address this possible safety concern we have developed and 
tested two further-attenuated rVSV/ZEBOV vaccine candidates for 
efficacy. One of these vaccines is based on an rVSV vector that has 
advanced through clinical evaluation. It was attenuated by translocat- 
ing the VSV nucleoprotein (N) gene from position 1 to position 4 in 
the genome (N4) and truncating the cytoplasmic tail (CT) of the VSV 
G protein from 29 to 1 amino acids (CT1)””. This rVSVN4CT1 vector 
was modified to maximally express HIV-1 gag from position 1 in the 
genome (rVSVN4CT 1gag1) by positioning the gag gene immediately 
adjacent to the single strong 3’ VSV transcription promoter. The 
rVSVN4CT lgagl vector has demonstrated safety in mouse and 
NHP neurovirulence studies'*"’, and replication is restricted to the 
IM inoculation site and draining lymph node following vaccination 
of mice’*. The rVSVN4CT 1gagl vector has demonstrated safety and 
immunogenicity in two phase I clinical trials (HVTN 090 and HVTN 
087: http://clinicaltrials.gov/) and no post-vaccination viraemia was 
detected in urine, saliva, and blood of vaccine recipients. The 
rVSVN4CTI1GPI1 vector described here (Fig. 1a, N4) is analogous in 
design to that of the rVSVN4CT1gagl vaccine and expresses ZEBOV 
GP from genome position 1. The other attenuated rVSV/ZEBOV 


1Galveston National Laboratory, University of Texas Medical Branch, Galveston, Texas 77555, USA. *Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas 
77555, USA. *Department of Virology and Vaccine Vectors, Profectus BioSciences, Inc., Tarrytown, New York 10591, USA. “Department of Immunology, Profectus BioSciences, Inc., Tarrytown, New York 


10591, USA. 
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Figure 1 | rVSV/ZEBOV vector design, growth kinetics and vaccine study strategy. a, Genome organization comparing ZEBOV GP (Mayinga strain)- 
expressing rVSV vectors as described in methods. The rVSV/ZEBOVAG (AG) vector had the natural VSV G gene replaced with the ZEBOV GP at position 4 
within the genome. rVSVN1CT1GP3 (N1) vector retained the position of VSV N in position 1 (orange box), insertion of ZEBOV GP at position 3 and a truncated 
form of VSV G containing the CT1 truncation was inserted at position 6. The rVSVN4CT1GP1 (N4) vector had the insertion of ZEBOV GP in position 1, 
attenuating N gene translocation (N4) (black box) and truncated G protein cytoplasmic tail (CT1). Numbers above vector constructs designate genome positions. 
Virus leader (Le), trailer (Tr), and intergenic regions are shown in black. Shaded regions represent deleted amino acid regions. b, Single-cycle growth kinetics 
comparing the AG, N1, and N4 vectors depicted in a. Data shown are mean + s.d. from two biological replicates titrated by plaque assay in triplicate. Titre 
differences between AG and N1 vectors were statistically significant at 4 (P = 0.0001), 12 (P = 0.0055), and 24h post infection (P = 0.0001). Likewise, AG and N4 
vector titres were significantly different at 4 (P = 0.0001), 12 (P = 0.0005), 24 (P = 0.0001), and 48h post infection (P = 0.0068). Unpaired t-test, P = 0.05. 

c, Crystal violet-stained Vero cell monolayers showing plaques generated by the AG, N1, and N4 vectors at 48 h post infection. d, Flow chart showing the day of 
vaccination (triangles), days of sampling (arrows), day of challenge (*). Blue triangle, unvaccinated cohort; orange triangle, N1-vaccinated cohort; black triangle, 
N4-vaccinated cohort. 


Hours post infection 


vaccine described here (rVSVN1CT1GP3), expressing a truncated 
form of VSV G, was designed to be of intermediate attenuation 
between rVSVN4CTIGP1 and the first generation rVSV/ 
ZEBOVAG vaccine (Fig. 1a, N1). Both attenuated rVSV/ZEBOV 
vectors express GP from the ZEBOV Mayinga strain, as do most 
other candidate ZEBOV vaccines currently under evaluation. 
Sequence homology between GPs from the new West African 
Makona strains analysed to date and the 1976 Mayinga strain is 
approximately 97%. Although this difference is not likely to affect 
the protective efficacy of the current ZEBOV vaccines against the 


Table 1 | Clinical findings for NHPs challenged with ZEBOV-Guinea 


heterologous West African strains, it is possible that small changes 
in sequence could lead to reduced efficacy of a vaccine". It is well 
established that small variations in sequence and even single amino 
acid changes in sequence for other viruses including influenza, 
respiratory syncytial virus, polio, equine infectious anaemia virus, 
and SIV can reduce vaccine efficacy. Here we assessed the ability of 
our newly developed next-generation rVSV-based vaccines expres- 
sing ZEBOV Mayinga GP to protect against heterologous challenge 
with the new outbreak Makona strain of ZEBOV in cynomolgus 
monkeys. 


Animal Vaccine Day —26* PRNTsot Clinical signs observed} Final outcome 
129 N/A = 0/0 Fever (6), anorexia (5-8), depression (6-8), mild rash Expired day 8 
(6-8), lymphopenia (3, 6), thrombocytopenia (6), 
ALT->(6), ALP>—> — (6), AST>->-— (6), GGT>—- (6), 
CRP increase (6) 
276 N/A = 0/0 Fever (6), anorexia (6-7), depression (6-7), mild rash Expired day 7 
(6-7), thrombocytopenia (6, 10), ALT-(6), 
ALP-+—- (6), AST>->—= (6), GGT (6), CRP 
increase (6) 
0910078 N1 + 0/40 O8 Survived 
1001100 Nl = 0/160 CRP increase (6) Survived 
117 N1 i 0/80 Lymphopenia (6), CRP increase (6, 10) Survived 
0907095 N1 — 0/160 Lymphopenia (6, 10), CRP increase (6, 10), Survived 
ALT—-(6), ALP-+(6), AST>—(6) 
0807174 N4 + 0/160 Lymphopenia (6), CRP increase (6, 10) Survived 
0901014 N4 - 0/80 @ Survived 
119 N4 + 0/80 @ Survived 
0811013 N4 + 0/20 © Survived 


*rVSV viraemia 2 days post vaccination. —, below limit of detection (25 PFU per ml); +, up to 3 X10 PFU per ml. 

+ 50% plaque reduction neutralization titre at day of challenge and terminal day presented as day of challenge/terminal day. 

¢ Days after ZEBOV challenge are in parentheses. Fever is defined as a temperature more than 1.4 °C above baseline or at least 0.9°C above baseline and = 39.7°C. Lymphopenia and thrombocytopenia are defined 
by a = 35% drop in numbers of lymphocytes and platelets, respectively. ALT, alanine aminotransferase; ALP, alkaline phosphatase; AST, aspartate aminotransferase; GGT, gamma glutamyltransferase. CRP, 


C-reactive protein: two- to threefold increase,>; 4- to fivefold increase, +; > 5 fold increase, =—. 
§ No symptoms observed. 
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Results from an in vitro growth kinetics study (Fig. 1b) indicate an 
approximate tenfold reduction in growth rate early in infection for 
rVSVN4CTIGPI1 relative to rVSV/ZEBOVAG. Also noted during 
virus plaque assay were the larger more rapidly forming plaques gen- 
erated by rVSV/ZEBOVAG compared to rVSVN4CTIGP1, with 
rVSVNICTI1GP3 showing intermediate growth and plaque size 
(Fig. 1c). 

We next tested if the further-attenuated rVSV/ZEBOV vaccines 
could provide NHPs with single-dose protection against challenge 
with ZEBOV isolated from the current outbreak in Guinea’’. Groups 
of four cynomolgus macaques were inoculated intramuscularly with 
2X 10’ plaque-forming units (PFU) of either rVSVN4CTIGP1 or 
rVSVN1CT1GP3; a group of two control macaques were unvaccinated 
(Fig. 1d, arrow heads). None of the macaques showed any sign of illness 
or distress following vaccine administration. Consistent with the stat- 
istically significant growth differences between rVSV/ZEBOVAG and 
the more attenuated vectors seen during in vitro growth kinetics stud- 
ies, levels of both attenuated vaccine viruses detected in the blood of 
vaccinated macaques (500 PFU per ml) were 10- to 50-fold lower than 
those detected for the more replication competent rVSV/ZEBOVAG? 
(Table 1, day —26). The ZEBOV GP-specific humoral immune res- 
ponse was assessed for all animals before vaccination (Fig. 2a, —28) and 
after vaccination (Fig. 2a, —18 and 0) by IgG capture ELISA and neut- 
ralizing antibody titres (Table 1, plaque 50% reduction neutralization 
test (PRNT)s0). Results showed neutralizing titres at terminal days for 
vaccinated cohorts and detectable circulating levels of anti-ZEBOV GP 
IgG for both vaccine cohorts after vaccination and before challenge 
with no detectable levels for the unvaccinated control animals 
(Fig. 2a). A cell-mediated immune response was also detected in all 
vaccinated animals by ZEBOV GP-specific interferon gamma (IFN-y) 
ELISpot assay 10 days after vaccination (Extended Data Fig. 1a and b). 

The eight vaccinated and two unvaccinated control macaques were 
challenged by intramuscular injection with 1,000 PFU ofa low passage 
100% 7U Makona strain stock of ZEBOV"* 28 days after the single 
injection vaccination (Fig. 1d, asterisk). None of the animals 


51,200 [=a 0910078 (N1) 
@ 25,600 TT 1001100 (N1) 
= me 117 (N1) 
a % 0907095 (N1) 
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Figure 2 | N1 and N4 vaccination results in circulating anti-ZEBOV GP IgG 
and protection in cynomolgus macaques. a, Reciprocal endpoint dilution titres 
for circulating IgG against ZEBOV GP for control (blue), N1 (orange), and N4 
cohorts (black-grey) on day of vaccination (—28), 10 days post vaccination (—18), 
and on day of challenge (0). Red dashed line depicts limit of detection for ELISA 
assay. Error bars represent s.e.m. b, Kaplan-Meier survival curve for each cohort 
post ZEBOV challenge. ¢, Circulating infectious virus load displayed as plaque 
forming units per ml. Data shown are from individual animals. Lower limit of 
detection is 25 PFU per ml. 
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vaccinated with either of the two further-attenuated rVSV/ZEBOV 
vectors showed any severe signs of illness following challenge with 
ZEBOV (Table 1), whereas the two unvaccinated control macaques 
succumbed to disease on days 7 and 8 (Fig. 2b). Circulating infectious 
ZEBOV was isolated from both of the unvaccinated control macaques 
on days 3 and 6 post challenge (Fig. 2c, blue) but no circulating 
infectious ZEBOV could be detected in any of the vaccinated animals. 
Examination of tissues by immunohistochemistry showed abundant 
ZEBOV antigen in tissues of the unvaccinated control animals (129 
and 276) (Fig. 3a-d) whereas ZEBOV antigen was not detected in 
tissues of the macaques vaccinated with rVSVN1CT1GP3 (1001100) 
or rVSVN4CT1GP1 (0807174) (Fig. 3e-h). 

Here we show protection against a new West African Makona strain 
of ZEBOV using a novel filovirus vaccine platform. The large reduc- 
tion in vaccine-associated viraemia indicates a significant increase of in 
vivo attenuation for these next-generation rVSV/ZEBOV vaccine vec- 
tors, which should translate into greater safety and reduced adverse 


Liver 


Spleen 


129 


276 


Figure 3 | Comparison of ZEBOV antigen in tissues of cynomolgus 
macaques either vaccinated or unvaccinated. a, c, Liver, diffuse cytoplasmic 
immunolabelling (brown) of sinusoidal lining cells in both ZEBOV-infected 
control animals. b, d, Spleen, diffuse cytoplasmic immunolabelling of 
dendriform mononuclear cells in the red and white pulp of ZEBOV-infected 
control animals. e, f, Liver and spleen, respectively, with a lack of 
immunolabelling from N1 cohort animal 0910078. g, h, Liver and spleen, 
respectively, with a lack of immunolabelling from N4 cohort animal 0807174. 
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events in humans. Importantly, single-dose vaccination of NHPs with 
highly attenuated forms of rVSV expressing ZEBOV Mayinga GP 
provides complete protection from heterologous challenge with a 
highly virulent 7U ZEBOV isolated early during the current West 
African outbreak'®. ZEBOV genome sequencing from cases later dur- 
ing the West Africa outbreak has revealed little drift in the GP gene’”"*, 
suggesting that this vaccine platform could also be efficacious against 
currently circulating ZEBOV. These findings pave the way for the 
identification and manufacture of safer, single-dose, high efficacy vac- 
cine(s) to combat current and future filovirus outbreaks in Africa and 
their potential use as biological weapons. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Generation of N4 and N1 ZEBOV vectors. As described previously'*”” an 
rVSV;NN4CT1gag1 vector (Indian serotype) expressing HIV-1 gag was used as 
the backbone for generating the attenuated rVSVN4CT1 vector expressing the 
Zaire ebolavirus (ZEBOV) glycoprotein (GP). The corresponding 
rVSVjNN4CT1gagl genomic cDNA was modified by exchanging the gag gene 
expression cassette via Xhol/Notl restriction sites with an expression cassette 
encoding a full length ZEBOV GP [1976, Mayinga strain], generating the 
rVSViNN4CT1-ZEBOVGP1 cDNA (Fig. 1a, N4). The N1 vector was generated 
by first inserting ZEBOV GP into a VSV-N1AG backbone via Xhol/Not! restric- 
tion sites within a transcriptional cassette located at position 3 in the genome; 
followed by the insertion of a PCR fragment containing a portion of VSV L, a 
modified VSV G CT1 gene and trailer into the N1 genome at position 6 via the 
HindIIlI/RsrII sites, generating the rVSV;,N1(G CT1)6-ZEBOVGP3 cDNA 
(Fig. 1a, N1). 

The rVSV-ZEBOV vectors were rescued from genomic cDNA as previously 
described”’. Rescued virus was plaque purified and amplified on Vero cell mono- 
layers (ATCC, CCL-81). For animal studies, virus vectors were purified from 
infected BHK-21(ATCC CCL-10) cell supernatants by centrifugation through a 
10% sucrose cushion. Purified virus was resuspended in PBS, pH 7.0, mixed with a 
sucrose phosphate (SP) stabilizer (7 mM K,HPO,, 4mM KH>PO,, 218 mM suc- 
rose), snap frozen in ethanol/dry ice and stored at —80 °C until ready for use. 
Growth kinetics study of AG, N4 and N1 ZEBOV vectors. Single-step growth 
curves were performed by adsorbing the N4, N1 and a AG control virus to duplic- 
ate monolayers of Vero cells (ATCC, CCL-81) in six-well plates at a multiplicity of 
infection (MOI) of 10 for 15 min at room temperature with continued rocking 
followed by incubation at 37°C with 5% CO, for 30min without 
agitation. The inoculum was aspirated, the cells washed 3X with serum-free 
Dulbecco’s minimal Eagle’s medium (DMEM) and then DMEM containing 
5% fetal bovine serum (FBS) was added to the plates, which were placed at 
32°C with 5% CO. Samples for titration were taken at 4, 8, 12, 16, 24 and 48h 
post infection and replaced with the same volume of fresh media. Virus titres were 
determined in duplicate by plaque assay on Vero cells. Growth curves were per- 
formed in triplicate for each virus. Plaque images for each vector were taken at 48 h 
post infection, after staining with a 1% crystal violet solution. Statistical analysis of 
rVSV titres were performed using unpaired t-test with a 95% confidence level 
(P <0.05) with the GraphPad Prism program. 

Challenge virus. The ZEBOV Makona strain seed stock originated from serum 
from a fatal case early during the 2014 outbreak in Guékédou, Guinea'® (NCBI 
accession number KJ660347) and was passaged twice in Vero E6 cells (ATCC, 
CRL-1586). The virus stock was deep sequenced as 100% 7U at the GP editing site 
in the viral genome (see below). 

Deep sequencing. Approximately 1 ml of the ZEBOV Makona strain seed stock 
was removed from the seed stock vial and placed in 5 ml of TRIzol LS and vortexed 
three times and allowed to sit for 10min. The 6 ml were then placed into two 
separate 3 ml Nunc cryo-vials for removal from the BSL-4. RNA was isolated from 
the TRIzol LS/sample mixture using Zymo Research Direct-zol RNA mini-prep 
per manufacturer’s instructions. Approximately 150 ng of purified RNA were used 
to make cDNA using the NuGen Ovation RNA-seq 2.0 kit ultimately for the 
preparation of the double-stranded DNA library using Encore Ion Torrent library 
prep kit. Sequencing was performed by the UTMB Molecular Core on the Ion 
Torrent using 318-v2 deep sequencing chips. Sequence analysis was performed 
using DNA Star Seqman NGen software based on paired-end analysis of 100 base 
pairs overlaps. 

Vaccination and animal challenge. Ten, healthy, filovirus-naive, adult (~3 to 
9.5kg, 7 female and 3 male), Chinese origin cynomolgus macaques (Macaca 
fascicularis) were randomized with Microsoft Excel into two experiment groups 
of four animals each and a control group of two animals. Animals in one experi- 
mental group were vaccinated by intramuscular injection of approximately 
2 X 10’ PFU of the rVSVN4CT1GP vaccine while animals in the other experi- 
mental group were vaccinated with approximately 2 X 10’ PFU of the VSV- 
N1CT1 ZEBOVGP vaccine. The two control animals were not vaccinated. Four 
weeks after the single injection vaccination all ten animals were challenged by 
intramuscular injection with 1,000 PFU of the ZEBOV Makona strain virus. All 
animals were given physical exams and blood was collected before vaccination, at 
day 10 after vaccination, at the time of ZEBOV challenge and on days 3, 6, 10, 14, 
21 and 28 after ZEBOV challenge (Fig. 1d, arrows). Animals were monitored daily 
and scored for disease progression with an internal filovirus scoring protocol 
approved by the UTMB Institutional Animal Care and Use Committee. The 
scoring changes measured from baseline included posture/activity level, atti- 
tude/behaviour, food and water intake, weight, respiration, and disease manifesta- 


tions such as visible rash, haemorrhage, ecchymosis, or flushed skin. A score of = 9 
indicated that an animal met criteria for euthanasia. This study was not blinded. 
Anti-ZEBOV GP IgG ELISA. Serum collected at indicated time points was tested 
for immunoglobulin G (IgG) antibodies against ZEBOV. Enzyme-linked immu- 
nosorbent assay (ELISA) using recombinant ZEBOV GPdTM purified protein 
(Integrated BioTherapeutics, Inc.) was used to detect cross-reactive IgG. 
ZEBOV GPdTM was diluted to an optimal working concentration of 100 ng per 
well in 0.1 ml carbonate/bicarbonate buffer (carbonate/bicarbonate buffer with 
azide tablets from Sigma catalogue number 08058-50TAB-F) and used to coat 
Immulon 2HB flat bottom ELISA plates (Thermo Labsystem catalogue number 
3455) for 18h at 4 °C. Coated plates were blocked (10% FBS + 1X PBS) for at least 
2h. The serum samples were assayed at twofold dilutions starting at a 1:100 
dilution in ELISA diluent (1% heat inactivated fetal bovine serum (HI-FBS), 
1X PBS, and 0.2% Tween-20). Samples were incubated for 1h at room temper- 
ature, removed, and plates were washed. Wells were then incubated at room 
temperature for 1h with anti-monkey IgG conjugated to horseradish peroxidase 
(Fitzgerald Industries International) at a 1:2,500 dilution. These wells were washed 
and then incubated with 2,2’-azine-di(3ethylbenzthiazoline-6-sulfonate) peroxi- 
dase substrate system (KPL) at room temperature for approximately 10 min. 
Reaction was stopped with 1% SDS and read for dilution endpoints at 405 nm 
ona microplate reader (Molecular Devices Emax system). Absorbance values were 
normalized by subtraction of background A4o5nm from uncoated wells for each 
serum dilution. Antigen-specific serum IgG end-point titres were defined as the 
reciprocal of the last normalized serum dilution giving an A4o5 nm greater than 0.1. 
ZEBOV neutralization assay. Neutralization assays were performed by mea- 
suring plaque reduction in a constant virus:serum dilution format as previously 
described”. Briefly, a standard amount of ZEBOV (~ 100 PFU) was incubated with 
serial twofold dilutions of the serum sample for 60 min. The mixture was used to 
inoculate Vero E6 cells (ATCC, CRL-1586) for 60 min. Cells were overlaid with an 
agar medium, incubated for 7 days, and plaques were counted 48h after neutral 
red staining. Endpoint titres were determined by the dilution of serum, which 
neutralized 50% of the plaques (PRNTs9). 

IFN-y ELISpot assay. Ninety-six-well flat-bottomed ELISpot plates (Millipore) 
were coated overnight with a mouse anti-human IFN-y monoclonal antibody 
(clone 27; BD-Pharmingen) at a concentration of 10 1g ml, after which the 
plates were washed three times with 1X PBS and then blocked for 2h with PBS 
containing 5% heat-inactivated FBS. Heparinized whole blood was collected 
10 days after immunization of macaques, and peripheral blood mononuclear cells 
(PBMCs) were isolated from whole blood by Ficoll-Hypaque density gradient 
centrifugation, and resuspended in complete RO5 culture medium. The isolated 
macaque PBMCs were washed once with complete RO5 culture medium and 
resuspended in complete ROS culture medium containing either 5 ugml_' phy- 
tohaemagglutinin mucoprotein (Sigma), peptide pools (15-mers overlapping by 
11 amino acids; final peptide concentration, 1 1M each) spanning the ZEBOV 
Mayinga strain GP, or medium alone. The input cell number was 2 X 10° PBMCs 
per well (2 X 10° PBMCs per ml), and cells were assayed in duplicate wells. Cells 
were incubated for 18 to 24h at 37 °C and then removed from the ELISpot plate by 
first being washed with deionized water and then being washed six times with 1X 
PBS containing 0.25% Tween 20. Thereafter, plates were treated with a rabbit 
polyclonal anti-human IFN-y biotinylated detection antibody (0.65 1g per well; 
Life Technologies) diluted with 1x PBS containing 1% bovine serum albumin 
(BSA) and were incubated at 37°C for 2h. ELISpot plates were then washed 6 
times with 1X PBS containing 0.25% Tween 20, treated with 100 pl per well of 
streptavidin-horseradish peroxidase conjugate (BD Biosciences) diluted 1:250 
with 1X PBS containing 10% FBS and 0.005% Tween 20, and incubated for an 
additional 1h at room temperature. Unbound conjugate was removed by rinsing 
the plate six times with 1x PBS containing 0.25% Tween 20 and three times with 
1X PBS. A chromogenic substrate (100 pl per well) (one-step nitroblue tetrazo- 
lium/5-bromo-4-chloro-3-indolylphosphate (NBT/BCIP); Pierce) was then added 
for 3 to 5 min before being rinsed away with water, after which the plates were air 
dried and the resulting spots counted using an ImmunoSpot reader (CTL Inc.). 
Peptide-specific IFN-y ELISpot responses were considered positive if the res- 
ponses (minus the medium background) were threefold above the medium res- 
ponse and 50 spot-forming cells (SFC) per 10° PBMCs. Unpaired t-test analysis of 
IFN-y ELISpot data was performed on GraphPad Prism version 5.02 software. 
Two-tailed P values less than 0.05 indicated that the tests were statistically signifi- 
cant. 

Detection of viraemia. Virus titration of the rVSV vaccine vectors and ZEBOV 
was performed by plaque assay with Vero E6 cells (ATCC, CRL-1586) from cell 
culture or serum samples as previously described’. Briefly, increasing tenfold 
dilutions of the samples were adsorbed to Vero E6 monolayers in duplicate wells 
(200 jl); the limit of detection was 25 PFU per ml. 
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Haematology and serum biochemistry. Total white blood cell counts, white 
blood cell differentials, red blood cell counts, platelet counts, haematocrit values, 
total haemoglobin concentrations, mean cell volumes, mean corpuscular volumes, 
and mean corpuscular haemoglobin concentrations were analysed from blood 
collected in tubes containing EDTA using a laser based haematologic analyser 
(Beckman Coulter). Serum samples were tested for concentrations of albumin, 
amylase, alanine aminotransferase, aspartate aminotransferase, alkaline phospha- 
tase, gamma-glutamyltransferase, glucose, cholesterol, total protein, total biliru- 
bin, blood urea nitrogen, creatine, and C-reactive protein by using a Piccolo point- 
of-care analyser and Biochemistry Panel Plus analyser discs (Abaxis). 

Histopathology and immunohistochemistry. Necropsy was performed on all 
subjects. Tissue samples of all major organs were collected for histopathological 
and immunohistochemical examination, immersion-fixed in 10% neutral buffered 
formalin, and processed for histopathology as previously described*!. For immuno- 
histochemistry, specific anti-ZEBOV immunoreactivity was detected using an anti- 
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ZEBOV VP40 protein rabbit primary antibody (Integrated BioTherapeutics, Inc.) at 
a 1:4,000 dilution. In brief, tissue sections were processed for immunohistochemistry 
using the Dako Autostainer (Dako). Secondary antibody used was biotinylated goat 
anti-rabbit IgG (Vector Laboratories) at 1:200 followed by Dako LSAB2 streptavidin- 
horseradish peroxidase (Dako). Slides were developed with Dako DAB chromagen 
(Dako) and counterstained with haematoxylin. Non-immune rabbit IgG was used as 
a negative control. 


19. Cooper, D. et al. Attenuation of recombinant vesicular stomatitis virus HIV-1 
vaccine vectors by gene translocations and G gene truncation reduces 
neurovirulence and enhances immunogenicity in mice. J. Virol. 82, 207-219 
(2008). Medline CrossRef. 

20. Witko, S. E. etal. An efficient helper-virus-free method for rescue of recombinant 
paramyxoviruses and rhadoviruses from a cell line suitable for vaccine 
development. J. Virol. Methods 135, 91-101 (2006). Medline CrossRef. 

21. Thi, E. P. etal. Marburg virus infection in nonhuman primates: Therapeutic 
treatment by lipid-encapsulated siRNA. Sci. Transl. Med. 6, 250ra116 (2014). 
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Mutant MHC class II epitopes drive therapeutic 
immune responses to cancer 


Sebastian Kreiter', Mathias Vormehr*, Niels van de Roemer2*, Mustafa Diken', Martin Lower’, Jan Diekmann!’, 
Sebastian Boegel', Barbara Schrors', Fulvia Vascotto!, John C. Castle’, Arbel D. Tadmor!, Stephen P. Schoenberger‘, 


Christoph Huber’, Ozlem Tiireci'§ & Ugur Sahin'**8 


Tumour-specific mutations are ideal targets for cancer immuno- 
therapy as they lack expression in healthy tissues and can poten- 
tially be recognized as neo-antigens by the mature T-cell repertoire. 
Their systematic targeting by vaccine approaches, however, has 
been hampered by the fact that every patient’s tumour possesses 
a unique set of mutations (‘the mutanome’) that must first 
be identified. Recently, we proposed a personalized immuno- 
therapy approach to target the full spectrum of a patient’s 
individual tumour-specific mutations’. Here we show in three 
independent murine tumour models that a considerable fraction 
of non-synonymous cancer mutations is immunogenic and that, 
unexpectedly, the majority of the immunogenic mutanome 
is recognized by CD4* T cells. Vaccination with such CD4* 
immunogenic mutations confers strong antitumour activity. 
Encouraged by these findings, we established a process by which 
mutations identified by exome sequencing could be selected as 
vaccine targets solely through bioinformatic prioritization on 
the basis of their expression levels and major histocompatibility 
complex (MHC) class II-binding capacity for rapid production as 
synthetic poly-neo-epitope messenger RNA vaccines. We show 
that vaccination with such polytope mRNA vaccines induces 
potent tumour control and complete rejection of established 
aggressively growing tumours in mice. Moreover, we demonstrate 
that CD4* T cell neo-epitope vaccination reshapes the tumour 
microenvironment and induces cytotoxic T lymphocyte responses 
against an independent immunodominant antigen in mice, indi- 
cating orchestration of antigen spread. Finally, we demonstrate an 
abundance of mutations predicted to bind to MHC class II in 
human cancers as well by employing the same predictive algorithm 
on corresponding human cancer types. Thus, the tailored immuno- 
therapy approach introduced here may be regarded as a universally 
applicable blueprint for comprehensive exploitation of the substan- 
tial neo-epitope target repertoire of cancers, enabling the effective 
targeting of every patient’s tumour with vaccines produced ‘just 
in time’. 

We recently reported comprehensive mapping of non-synonymous 
mutations of the B16F10 tumour by next-generation sequencing 
(Fig. 1a)’. Tumour-bearing C57BL/6 mice were immunized with syn- 
thetic 27mer peptides encoding the mutated epitope (mutation in 
position 14), resulting in T-cell responses which conferred in vivo 
tumour control. In continuation of that work, we now characterized 
the T-cell responses against the neo-epitopes, starting with those with a 
high likelihood of MHC I binding. Mice were vaccinated with syn- 
thetic 27mer peptides (Fig. 1b). Immunogenic mutations were iden- 
tified by IFN-y ELISpot of splenocytes and T-cell subtype was 
determined by cytokine release assay (Fig. 1a). About 30% of neo- 
epitopes were found to induce mutation-reactive cytokine-secreting 


T cells. Surprisingly, responses against nearly all mutated epitopes 
(16/17, 95%) were CD4* (Fig. 1b, Extended Data Table 1). 

To exclude bias associated with the peptide format, this experiment 
was repeated using in vitro transcribed (IVT) mRNA encoding the 
neo-epitopes. Also in this setting the majority of mutation-specific 
immune responses (10/12, ~80%) were conferred by CD4* T cells 
(Extended Data Fig. 1, Extended Data Table 1). 

Recently, we identified over 1,680 non-synonymous mutations? in 
the colon carcinoma model CT26 in BALB/c mice. We selected 48 of 
these mutations in analogy to the B16F10 study based on good MHC 
class I binding (‘low score’ 0.1-2.1). The other half was deliberately 
chosen for poor binding (‘high score’ >3.9). In total, about 20% of 
mutated epitopes were found to be immunogenic in mice immunized 
with the respective RNA monotopes (Fig. 1c, Extended Data Table 2). 
In the ‘low’ MHC I score subgroup, but not in the ‘high’ score sub- 
group, several epitopes inducing CD8” T cells were identified (Fig. 1c 
right). MHC class II-restricted epitopes were represented in similar 
frequency in both subgroups, constituting the majority of CT26 
immunogenic mutations (16/21, 80%). 

Similarly, in the 4T1 mammary carcinoma model about 70% of the 
immunogenic epitopes determined by RNA monotope vaccines repre- 
senting all 38 mutations of this model were recognized by CD4* T cells 
(Fig. 1d, Extended Data Table 3). In summary, we showed that in three 
independent mouse tumour models with different MHC backgrounds, 
a considerable fraction of non-synonymous cancer mutations is 
immunogenic and quite unexpectedly, the immunogenic mutanome 
is predominantly recognized by CD4* T cells. 

To investigate whether MHC class II-restricted cancer mutations 
are good vaccine targets in vivo, we engineered pharmacologically 
optimized RNA (stabilizing elements in RNA sequence and liposomal 
formulation)*° encoding B16-M30, one of the epitopes that elicited 
strong CD4* T-cell responses in the B16F10 tumour model. The 
mutated amino acid was essential for T-cell recognition, hence the 
wild type peptide was not recognized (Extended Data Fig. 2a). When 
B16F10 tumour-bearing C57BL/6 mice were repeatedly vaccinated 
with the B16-M30 RNA monotope, tumour growth was profoundly 
retarded (Fig. 2a). About two thirds of the neo-epitope RNA treated 
mice were still alive at day 100, while all the control RNA treated mice 
had died by 65 days. Depletion experiments in B16-M30 RNA treated 
mice revealed involvement of CD4* but not CD8™ T-cells for thera- 
peutic antitumour efficacy of the neo-epitope (Fig. 2a right). 

Similarly, lung metastases of luciferase transduced B16F10 cells 
were efficiently eradicated with B16-M30 RNA but not control RNA 
in the vast majority of mice as shown by bioluminescence imaging 
(BLD) (Fig. 2b). Tumour infiltrating leukocytes purified from tumours 
of B16-M30 RNA immunized mice strongly reacted against B16-M30 
(Fig. 2c). In tumours from neo-epitope-encoding RNA treated versus 
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Figure 1 | Cancer-associated mutations are frequently immunogenic and 
pre-dominantly recognized by CD4* T cells. a, Schematic describing 
mutation discovery and immunogenicity testing. b-d, Splenocytes of mice 
vaccinated with peptides and polyinosinic:polycytidylic acid (polyI:C) 

(b, B16F10, n = 5) or immunized with antigen-encoding RNA (c, CT26, n = 5; 
d, 4T1, n = 3) were tested for recognition of mutated peptides by ELISpot. 
Subsequent subtyping was performed via MHC II blockade or intracellular 
cytokine and CD4/CD8 surface staining. Pie charts represent the prevalence of 
non-immunogenic, MHC class I- or II-restricted mutated epitopes. b, Right, 
subtyping of mutation-specific T cells. ¢, Right, MHC restriction of neo- 
epitopes prioritized based on either good (0.1-2.1) or poor (>3.9) MHC I 
binding scores. 


control mice, CD4* as well as CD8* T-cell infiltrates were significantly 
more abundant, whereas MDSCs and FoxP3™ T cells were significantly 
reduced (Fig. 2d). To benchmark antitumour efficacy, we tested 
immunogenic B16F10 neo-epitopes and a non-mutated TRP2-derived 
immunodominant epitope’. Whereas with TRP2 RNA, the two class I 
and three of the mutated class II neo-epitopes conferred a modest non- 
significant antitumour activity, three class II neo-epitopes mediated a 
significant inhibition of tumour growth (Extended Data Fig. 2d). Taken 
together, these data establish that a significant proportion of class II 
neo-epitopes have antitumour vaccine potency. Establishing B16-M30 
as a novel rejection antigen in B16F10 our findings show that RNA 
encoding a single neo-epitope may give rise to functional T cells capable 
of targeting into the cancer lesion, controlling and even curing murine 
tumours. Our findings are in agreement with recent reports of the 
pivotal role of CD4* T-cell immunity in the control of cancer”*. 
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Figure 2 | Efficient tumour control and survival benefit in B16F10 
melanoma with an RNA vaccine encoding a single mutated CD4* T-cell 
epitope. a, Tumour growth (mean + s.e.m.) and survival (+CD4- or CD8- 
depleting antibodies) in untreated (control) or B16-M30 immunized C57BL/6 
mice (1 = 10) inoculated subcutaneously (s.c.) with B16F10. b, B6 albino mice 
(n = 8) developing lung metastases upon intravenous (i.v.) injection of 
B16F10-Luc were treated with B16-M30 or irrelevant RNA (control). Median 
tumour growth was determined by BLI as photons per second. ¢, Single-cell 
suspensions of B16F10 tumours of irrelevant (control) or B16-M30 RNA 
immunized mice (n = 4) were restimulated with B16-M30 or irrelevant peptide 
(vesicular stomatitis virus nucleoprotein, VSV-NP52_59) and tested by ELISpot 
(mean + s.e.m.). Data pooled from two experiments. d, Frequency of 
infiltrating cells in s.c. B16F10 tumours (n = 3) left untreated (control) or 
vaccinated with B16-M30 RNA. 


The vast majority of mutations are unique to the individual patient. 
Hence, mutanome vaccines need to be individually tailored? and 
rapidly manufactured on-demand. This challenge can be viably 
addressed by RNA vaccine technology (Fig. 3a). At present, GMP- 
grade RNA is release-ready within 3 weeks. On another note, though 
we achieved tumour eradication in mice with a single mutation, to 
combine several mutations would be preferable to address tumour 
heterogeneity and immune editing, which mediate clinical failure of 
vaccines in humans’®*"'. We sought to integrate our insights into a 
concept which we call ‘mutanome engineered RNA immunotherapy’ 
(MERIT) (Fig. 3a). 

To test this concept, we engineered RNA monotopes encoding four 
MHC class II (CT26-M03, CT26-M20, CT26-M27, CT26-M68) and 
one MHC class I (CT26-M19) restricted mutation from the CT26 
model (see Extended Data Table 2) and a synthetic RNA pentatope 
encoding all five neo-epitopes connected by 10mer non-immunogenic 
glycine/serine linkers (Fig. 3a). In naive BALB/c mice the quantity of 
IFN-y-producing T cells elicited by the pentatope was comparable 
(3 of 5) or even higher than that evoked by the respective monotope 
(Extended Data Fig. 3a). 

In BALB/c mice with CT26 luciferase-transfected (CT26-Luc) 
lung metastases vaccinated repeatedly with a mixture of two RNA 
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Figure 3 | RNA pentatope immunization confers disease control and 
survival benefit in murine tumours. a, Engineering of a poly-neo-epitope 
RNA. b, BALB/c mice (n = 10) developing lung metastases upon i.v. injection 
of CT26-Luc were treated with a mixture of two pentatopes or left untreated 
(control). The median tumour growth by BLI, survival data and lungs from 
treated animals are shown. c, Upper panel, CD3 stained lung tissue sections. 
Scale bars: 1,000 um (scan), 100 um (top), 50 um (bottom). Lower panel, 
proportional lymphocyte areas in lung tumour tissue of control (n = 6) or 
pentatope-treated (CD3: n = 14; CD4, CD8, FoxP3: n = 12) animals. Lower 
panel right, tumour area (mean + s.e.m.) in sections of control (n = 18) and 
pentatopel + 2-treated (n = 39) mice. 


pentatopes (3 MHC class I- and 7 class II-restricted epitopes, Extended 
Data Table 4) including the mutations tested in the previous experi- 
ment, tumour growth was significantly inhibited (Fig. 3b). At day 32 all 
mice in the RNA pentatope group were alive whereas 80% of the 
control mice had already died. Dissection of the antitumour 
activity of single RNA pentatopes in the CT26 model revealed that 
RNA pentatope 2 has a very strong antitumour activity, whereas pen- 
tatope 1 is modestly active (Extended Data Fig. 3b). A subsequent 
study confirmed the antitumour activity of pentatope 2, and showed 
its significant decrease upon CD40L blockade and complete loss by 
CD8* T-cell depletion (Extended Data Fig. 3c). Tumour load in the 
vaccinated as compared to untreated mice was significantly lower as 
shown by post mortem macroscopic (Fig. 3b), histological (Fig. 3c 
upper panel) and computerized image analysis (Fig. 3c lower panel 
right) of tissue sections. CD3* T-cell infiltrates in tumour lesions of 
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pentatope RNA vaccinated mice, in contrast to findings in controls, 
were significantly brisker as compared to the surrounding lung tissues 
(Fig. 3c lower panel left). Moreover, in independent experiments we 
found a significant increase of CD4* and CD8" T cells in tumour 
lesions of RNA pentatope immunized mice but a lower FoxP3/CD4 
ratio as compared to tumours of mice treated with irrelevant RNA 
(Fig. 3c lower panel, Extended Data Fig. 3d). 

Taken together our data indicate that with a poly-neo-epitope 
encoding RNA vaccine T cells against each single epitope are elicited. 
These T cells target tumour lesions, recognize their mutated targets, 
reshape the cellular composition of the tumour microenvironment 
and result in efficient tumour control in vivo. 

The current paradigm for selecting mutations for immunization is 
to employ MHC class I binding scores for enrichment of mutated 
epitope candidates'?"* which can elicit CD8~ responses and tumour 
rejection. Our findings indicate that MHC class II presented neo- 
epitopes may be of higher interest for a MERIT approach. In fact, a 
correlation analysis revealed that immunogenic mutations have a sig- 
nificantly better MHC class II binding score as compared to non- 
immunogenic ones (Fig. 4a). As most cancers lack MHC class II 
expression, effective recognition of neo-epitopes by CD4* T cells 
should depend on presentation of released tumour antigens by antigen 
presenting cells (APCs). This is most efficient for highly expressed 
antigens’. Thus, we implemented an algorithm combining good 
MHC class II binding with abundant expression of the mRNA encod- 
ing the neo-epitope based on confirmed mutated RNA sequencing 
reads normalized to the overall read count (NVRC: normalized variant 
read counts). To test the impact of predicted MHC class II binding 
affinity, we ranked CT26 mutanome data with this algorithm and 
selected the top ten mutations (“ME’ mutations in Extended Data 
Table 5) predicted to be the best MHC class II binders among the 
most abundant candidate epitopes (NVRC = 60). The control com- 
prised ten mutations with abundant expression only (‘E’ mutations in 
Extended Data Table 5). These neo-epitopes were used without any 
further pre-validation or immunogenicity testing to engineer two RNA 
pentatopes for each group (Py and P; pentatopes). In mice with 
established CT26-Luc lung tumours, Pye induced a much stronger 
T-cell response as compared to Py pentatopes (Fig. 4c). Analysis of 
immune responses proved the presence of multiple immunogenic 
MHC class II neo-epitopes in the Pye RNA pentatopes (Extended 
Data Fig. 4). Established lung metastases were completely rejected in 
almost all Pye mice whereas Py pentatopes were not able to control 
tumour growth (Fig. 4b). An independent study confirmed the strong 
antitumour activity of the pentatope P\yz and showed loss of the anti- 
tumour effect upon anti-CD40L and anti-CD8 treatments (Fig. 4d). 

Antigen-specific Tj, cells promote the cross-priming of tumour- 
specific cytotoxic T lymphocyte (CTL) responses by CD40 ligand- 
mediated activation of dendritic cells. If Ty cells recognize their 
antigen on the same APC (cross-)presenting an CTL epitope, a diver- 
sified CTL response may result’®. In support of this notion, we detected 
strong CD8* T-cell responses against gp70-AH1, a well characterized 
immunodominant CTL epitope of the endogenous murine leukaemia 
virus-related cell surface antigen in the blood and spleen of mice 
immunized with P,;, but not P; pentatopes (Fig. 4e). This indicates 
that cancer neo-epitope specific Ty; cells, in analogy to viral neo- 
antigen specific T cells, may exert their antitumour function by 
augmentation of CTL responses through epitope spreading. 

In summary, we show that MHC class II-restricted T-cell epitopes 
are more abundant than previously appreciated in the cancer muta- 
nome and can be targeted by customized RNA-based poly-neo- 
epitope vaccines with substantial therapeutic effect in mouse tumour 
models. A recent study reports that about 0.5% of mutations induce 
spontaneous CD4* T-cell immune responses in human tumours”. 
Even by the most conservative estimate, our findings indicate that 
the proportion of relevant neo-epitope vaccine targets recognized 
by CD4* T-cells is a full log higher than that reported to induce 
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Figure 4 | RNA pentatope vaccines with mutations selected for in silico 
predicted favourable MHC class II binding and abundant expression confer 
potent antitumour control. a, Comparison of median MHC II binding scores 
of immunogenic (Response) and non-immunogenic (No response) mutated 
27mers. b, Highly expressed mutations were selected with (‘ME’) or without 
(E’) considering MHC class II binding score. Ten mutations (two pentatopes) 
per category were used for vaccination of CT26-Luc tumour-bearing mice 

(n = 10). Tumour growth, area under the curve (AUC) at day 40 and ink- 
treated lungs are shown. c, Mice (n = 5) were analysed for T-cell responses 
against the RNA pentatopes via ELISpot (mean + s.e.m. subtracted by an 
irrelevant RNA control). d, CT26 tumour nodules per lung of untreated mice or 
mice (n = 10) injected with irrelevant or Pp (+CD8 depletion or CD40L 
blocking) RNA. e, T-cell responses against gp70423_431 (gp70-AH1) determined 
via ELISpot in blood (pooled from 5 mice at day 20) and spleen (n = 5). 
Background (medium control) subtracted mean + s.e.m. shown. f, Genomic, 
expressed and predictively presented (HLA-DRB1, IEDB rank <10) non- 
synonymous single nucleotide variations (nsSNVs) derived from human 
cancer samples (TCGA). SKCM, skin cutaneous melanoma; COAD, colon 
adenocarcinoma; BRCA, breast invasive carcinoma. 
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spontaneous CD4* T-cell responses. The reason might be that spon- 
taneous immunogenicity is not driven by antigenicity alone, but is 
a function of many factors including tumour biology and local 
immunosuppression. Both B16 (Extended Data Fig. 2e) and CT26 
are tumours with strong immunosuppressive properties, such that 
they are not able to induce protective immunity unless transfected 
with immunomodulators such as GM-CSF. As our study shows that 
vaccination with CD4* T-cell neo-epitopes identified by exome 
sequencing counteracts the immunosuppressive tumour micro- 
environment in these models resulting in rejection of established 
tumours, one key prediction from our work is that such ‘poorly 
immunogenic’ tumours can be successfully targeted through induction 
of mutation-specific CD4* T-cell responses. 

A simple explanation for this remarkable frequency of CD4* T-cell 
recognition of mutations may be less stringent length and sequence 
requirement for peptides binding to MHC class II molecules as com- 
pared to MHC class I epitopes increasing the likelihood that a given 
mutation is found within a presented peptide’®. 

The first evidence of spontaneous CD8* and CD4* T-cell responses 
directed against mutated gene-products in cancer patients was gener- 
ated in the 1990s’’”°. Recent publications have renewed enthusiasm 
for the enormous potential of mutation-specific T cells to confer anti- 
tumour activity in cancer patients’**'. As elegantly proven in a murine 
carcinogen- induced sarcoma model” there is evidence for a direct link 
between T-cell recognition of mutant neo-epitopes and clinical res- 
ponse to checkpoint blockade treatment'*”. 

To assess whether the principles unraveled in the mouse models for 
melanoma, colon and breast cancer are true in the human setting, we 
analysed mutation and RNA-Seq data in the same human cancer types 
provided by The Cancer Genome Atlas (TCGA). For all three human 
cancers we confirmed the abundance of mutations predicted to bind to 
MHC class II (Fig. 4f). 

The MERIT approach we presented here by integrating advances in 
the field of next-generation sequencing, computational immunology 
and synthetic genomics provides the technology for comprehensive 
exploitation of the neo-epitope target repertoire. The approach might 
be particularly useful to reshape the tumour microenvironment in 
patients who lack T-cell infiltration in their tumours and applicable 
as a standalone or combination therapy to increase the clinical success 
rate of checkpoint blockade therapy that depends on pre-existing 
immunity*”*°. Targeting multiple mutations at once may in theory 
pave the way to solve critical problems in current cancer drug develop- 
ment such as clonal heterogeneity and antigen escape’®”’. 

Based on the findings of this study and of our prior work, a first-in- 
concept trial in melanoma patients’’’* has been initiated and is 
recruiting (NCT02035956), confirming that ‘just in time’ production 
of a poly-neo-epitope mRNA cancer vaccine is in fact feasible. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell lines and mice. Female 8-12 weeks old C57BL/6, BALB/c mice (Janvier Labs) 
and C57BL/6BrdCrHsd-Tyr* mice (B6 albino, Harlan) were kept in accordance 
with federal policies on animal research at the University of Mainz. B16F10 mel- 
anoma cell line, CT26 colon carcinoma cell line and 4T1-luc2-tdtomato (4T1-Luc) 
cells were purchased in 2010, 2011 and 2011, respectively (ATCC CRL-6475 lot no. 
58078645, ATCC CRL-2638 lot no. 58494154, Caliper 125669 lot no. 101648). 
Firefly-luciferase-expressing CT26-Luc and B16F10-Luc cells were lentivirally 
transduced. Master and working cell banks were generated immediately upon 
receipt, of which third and fourth passages were used for tumour experiments. 
Cells were tested for mycoplasma every 3 months. Reauthentification of cells was 
not performed since receipt. 

Next-generation sequencing and data processing. As described previously’, 
exome capture from mouse tumour cells and tail tissue samples of BALB/c or 
C57BL/6 mice were sequenced in triplicate (except 4T1-Luc in duplicate). Oligo 
(dT)-based RNA sequencing libraries for gene expression profiling were prepared 
in triplicate. Libraries were sequenced on an Illumina HiSeq2000 to generate 50 
nucleotide single-end (B16F10) or 100 nucleotide paired-end (CT26, 4T1-Luc) 
reads, respectively. Gene expression values were determined by counting reads 
overlapping transcript exons and junctions, and normalizing to RPKM expression 
units (reads which map per kilobase of transcript length per million mapped 
reads). Mutation expression was determined by normalization of mutated RNA 
reads to the total mapped read counts multiplied by 100 million (normalized 
variant read counts, NVRC). 

Mutation selection, validation and prioritization. As described previously 
mutations were selected based on following criteria: (1) present in the respective 
tumour cell line sequencing triplicates and absent in the corresponding healthy 
tissue sample triplicates, (2) occur in a RefSeq transcript, (3) cause non-synonym- 
ous changes, and (4) occurrence in expressed genes of tumour cell lines (median 
RPKM across replicates). For validation, mutations were amplified from DNA 
from the respective cell lines of mice tail tissue and subjected to Sanger sequencing. 
DNA-derived mutations were classified as validated if confirmed by either Sanger 
sequencing or the RNA-Seq reads. No confirmation via Sanger sequencing and 
immunogenicity testing was performed for experiments in Fig. 4. For experiments 
shown in Fig. 1 mutated epitopes were prioritized according to their MHC class I 
binding predicted by the consensus method (version 2.5) of the Immune Epitope 
Database (http://www.iedb.org). Mutations shown in Fig. 4b-e were selected based 
on either their expression (NVRC) alone or together with their predicted MHC 
class II peptide binding capability (IEDB consensus method version 2.5). 
Retrospective analysis of MHC II binding prediction shown in Fig. 4a was deter- 
mined with IEDB consensus method version 2.12. For analysis of mutations in 
human tumours, DNA sequencing data of skin cutaneous melanoma (SKCM, 
n = 308), colon adenocarcinoma (COAD, n = 192) or breast invasive carcinoma 
(BRCA, n = 872) retrieved from The Cancer Genome Atlas (TCGA) (August 2014) 
was filtered to obtain genomic non-synonymous point mutations (nsSNVs). RNA- 
Seq data (TCGA) of tumour samples with identified genomic mutations was used 
to define expressed nsSNVs (STAR aligner, reference genome: GRCh37/hg19, max. 
mismatch ratio of 2%). The reads mapping to the reference genome were inter- 
sected with the UCSC gene model database. To predict MHC II binding of 
expressed neo-epitopes, seq2HLA” was employed to identify the patients’ 4-digit 
HLA class I] (HLA-DQA1, HLA-DQB1, HLA-DRB1) type. The IEDB consensus 
binding prediction (version 2.12) was used to predict MHC class II binding from a 
27mer peptide and the patients HLA-DRB1 alleles. As recommended from IEDB, 
neo-epitopes with a percentile rank below 10% were considered as binders. 
Synthetic RNA and synthetic peptides. Non-synonymous mutations were 
studied in the context of the respective 27mer amino acid epitope with the mutated 
amino acid in the centre (position 14). Mutated peptides were synthesized together 
with gp70-AH1 (gp70423-431) and control peptides (vesicular stomatitis virus 
nucleoprotein (VSV-NP352-s9), tyrosinase-related protein 2 (TRP21g0-18g) by JPT 
Peptide Technologies GmbH. Alternatively, sequences encoding mutated 27mer 
peptides were cloned into the pST1-Sp-MITD-2hBgUTR-A120 backbone’ featur- 
ing sequence elements for pharmacologically optimized synthetic RNA in terms of 
translation efficiency and MHC class I/II processing of epitopes either as mono- 
topes or fused to each other by sequences encoding 10 amino acid long glycine- 
serine linker in between resulting in pentatopes. Linearization of these plasmid 
constructs, in vitro translation (IVT) and purification are described in detail 
elsewhere’. 

Mouse models. For immunogenicity studies of mutated epitopes age-matched 
female C57BL/6 or BALB/c mice were vaccinated on day 0, 3, 7 and 14 (RNA 
immunization) or day 0 and 7 (peptide immunization). The read out was per- 
formed five to six days after the last immunization (see also Enzyme-linked 
ImmunoSpot and Flow cytometric analysis method sections). Vaccination was 
performed either by i.v. injection of 200 ul (20 ug per mutation for B16F10, 40 pg 
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per mutation for CT26) RNA complexed with cationic lipids (manuscript in 
preparation) or subcutaneous (s.c.) injection of 100g synthetic peptide and 
50 ug poly(I:C) formulated in PBS (200 pl total volume) into the lateral flank. 
Two mutations per mouse were tested (n = 5 for B16F10, n = 3 for CT26). For 
confirmation of immunogenic mutations and subtyping, mice were vaccinated 
against a single mutation (n = 5). 

For therapeutic tumour experiments C57BL/6 mice were inoculated s.c. with 
1X 10° B16F10 melanoma cells into the flank and randomly distributed into 
treatment groups. Tumour volume was measured unblinded with a caliper and 
calculated using the formula (A X B’)/2 (A as the largest and B the smallest 
diameter of the tumour). Tumour growth was documented as mean tumour size 
with standard error disregarding single distant outliers. 

In lung metastasis experiments 5 X 10° CT26-Luc or 2 X 10° CT26 cells were 
injected into the tail vein of BALB/c mice or 1.5 X 10° B16F10-Luc tumour cells 
into B6 albino mice. Tumour growth of luciferase transgenic cells was traced 
unblinded by bioluminescence imaging after ip. injection of an aqueous solution 
of D-luciferin (250 pl, 1.6 mg, BD Bioscience) on an IVIS Lumina (Caliper Life 
Sciences). Five minutes after injection emitted photons were quantified. In 
vivo bioluminescence in regions of interest (ROI) were quantified as total flux 
(photons s~ 1) (IVIS Living Image 4.0). Mice were randomized based on their total 
flux values (ANOVA-P method, Daniel’s XL Toolbox V6.53). CT26 lung tumour 
burden was quantified unblinded after tracheal ink (1:10 diluted in PBS) injection 
and fixation with Fekete’s solution (5 ml 70% ethanol, 0.5 ml formalin, and 0.25 ml 
glacial acetic acid). In therapeutic experiments mice were administered repeated 
doses of either monotope (40 1g), pentatope RNA (in total 40 1g) or equimolar 
amounts of irrelevant RNA. In some experiments repeated doses (200 [ig per mouse 
itp.) of CD8-depleting (clone YTS191, BioXcell), CD4-depleting (clone YTS169.1, 
BioXcell) or CD40L-blocking (clone MR1) antibodies were administered. The 
experimental group sizes were approved by the regulatory authorities for animal 
welfare after being defined to balance statistical power, feasibility and ethical aspects. 
Enzyme-linked ImmunoSpot (ELISpot). As previously described’, 5 x 10° sple- 
nocytes were cultured over night at 37 °C in anti-INF-y (10 1g ml | clone AN18, 
Mabtech) coated Multiscreen 96-well plates (Millipore) and cytokine secretion 
was detected with an anti-IFN-y antibody (1 pg ml 1, clone R4-6A2, Mabtech). 
For stimulation either 2 pg ml~' peptide was added or spleen cells were coincu- 
bated with 5 x 104 syngeneic bone-marrow- derived dendritic cells (BMDC) trans- 
fected with RNA. For analysis of tumour infiltrating lymphocytes, single-cell 
suspensions of lung metastasis were rested overnight to get rid of living tumour 
cells via plastic adherence. Viable cells were separated via density gradient cent- 
rifugation and added to the ELISpot plate. For analysis of T-cell responses in 
peripheral blood, PBMC were isolated via density gradient centrifugation, counted 
and restimulated by addition of peptide and syngeneic BMDC. Subtyping of T-cell 
responses was performed with an MHC class II blocking antibody (20 1g ml’, 
clone M5/114, BioXcell). All samples were tested in duplicates or triplicates. 
Flow cytometric analysis. In the presence of Brefeldin A (Sigma-Aldrich) 2 X 10° 
splenocytes were stimulated with 2 X 10° RNA-transfected BMDC or 2 pg ml~? 
peptide. Splenocytes treated with phorbol 12-myristate 13-acetate (PMA, 0.5 
ug ml’, Sigma-Aldrich) and Ionomycin (1 tg ml’, Sigma-Aldrich) served as a 
positive control. Cells were incubated for 5 h at 37 °C, stained for CD4* and CD8* 
cell surface markers, permeabilized and fixed using BD Cytofix/Cytoperm accord- 
ing to the manufacturer’s protocol. Thereafter cells were stained for INF-y, TNF-a 
and IL-2 cytokines (BD Biosciences). Cytokine secretion among CD4* or CD8* T 
cells in stimulated samples was compared to control samples (medium, irrelevant 
RNA or irrelevant peptide) in order to determine the responding T-cell subtype 
(n = 5). Tumour infiltrating leucocytes were prepared from subcutaneous B16F10 
tumours fifteen to twenty days after inoculation. Tumours were harvested and 
minced into pieces of 1-2 mm diameter. The resulting cell suspension was har- 
vested, filtered through a 70-\1m cell strainer, washed two times and stained for 
CD4, CD8, Gr-1 and CD11b surface marker. Intracellular FoxP3 staining was 
performed according to the manufacturer’s protocol (Mouse Foxp3 Buffer Set, 
BD). Samples were acquired on a BD FACSCanto II. 

Immune histochemistry. Lungs of CT26 tumour bearing mice were fixed over- 
night in 4% phosphate buffered formaldehyde solution (Carl Roth) and embedded 
in paraffin. 50-j1m consecutive sections (3 per mouse) were stained for CD3 (clone 
SP7, Abcam), CD4 (clone 1, catalogue no. 50134-M08H, Sino Biologinal) and 
FoxP3 (polyclonal, catalogue no. NB100-39002, Novus Biologicals) following detec- 
tion by a HRP-conjugated antibody (Poly-HRP-anti-rabbit IgG, ImmunoLogic) 
and the corresponding peroxidase substrate (Vector Nova Red, Vector 
Laboratories) and counterstained with hematoxylin. CD3*, CD4*, FoxP3* and 
tumour areas were captured on an Axio Scan.Z1 (Zeiss) and manually pre-defined 
tumour and lung regions were quantified via computerized image analysis software 
(Tissue Studio 3.6.1, Definiens). CD8* area was calculated by subtracting CD4 
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stained area from CD3°* area. For comparison of tumour areas between control and 
pentatopel + 2-treated animals, tumour-free sections were excluded. 
Immunofluorescence staining. 8-j1m sections of cryo-conserved organs were 
attached on Superfrost slides, dried overnight at room temperature and fixed in 
4% para-formaldehyde (PFA) for 10 min at room temperature in the dark. Sections 
were washed 3 times with PBS and blocked using PBS supplemented with 1% BSA, 
5% mouse serum, 5% rat serum and 0.02% Nonident for 1 h at room temperature in 
the dark. Fluorescent labelled antibodies (FoxP3, clone FJK-16 s, eBioscience; CD8, 
clone 53-6.7, BD; CD4, clone RM4-5, BD) were diluted in staining buffer (PBS 
supplemented with 1% BSA, 5% mouse serum and 0.02% Nonident) and sections 
were stained overnight at 4°C. After washing twice with washing buffer (PBS 
supplemented with 1% BSA and 0. 02% Nonident) and once with PBS, slides were 
stained for 3 min with Hoechst (Sigma), washed 3 times with PBS, once with 
distilled water and mounted using Mounting Medium Flouromount G 
(eBioscience). Immunofluorescence images were acquired using an epifluorescence 
microscope (ApoTome, Zeiss). Tumour, CD4, CD8 and FoxP3 stained areas were 
quantified within manually pre-defined tumour regions via computerized image 
analysis software (Tissue Studio 3.6.1., Definiens). The proportion of marker pos- 
itive cells in comparison to DAPI positive cells was calculated. 


Statistics. Means were compared by using Student’s t-test for hypothesis testing to 
compare individual treatment and corresponding control groups. In case of sig- 
nificantly different variances (F-test, alpha = 0.05) Welch’s correction was used. 
Mann-Whitney U test was applied if data sets failed the Pearson omnibus nor- 
mality test (alpha = 0.05). Tumour growth was compared by calculating the area 
under the tumour growth curve (AUC) for single mice. Statistical differences in 
medians between two groups were calculated with a nonparametric Mann- 
Whitney U test. Survival benefit was determined with the log-rank test. All ana- 
lyses were two-tailed (except Fig. 4c, e) and carried out using GraphPad Prism 
5.03. n.s.: P > 0.05, *P S 0.05, **P < 0.01, ***P < 0.001. Grubb’s test was used for 
identification of outliers (alpha = 0.05). No statistical methods were used to pre- 
determine sample size. 


29. Boegel, S. et a/. A catalog of HLA type, HLA expression, and neo-epitope 
candidates in human cancer cell lines. Oncolmmunology 3, e954893 ( 
2014). 

30. Kreiter, S. et al. Intranodal vaccination with naked antigen-encoding RNA elicits 
potent prophylactic and therapeutic antitumoral immunity. Cancer Res. 70, 
9031-9040 (2010). 
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B16-M27: REGVELCPGNKYEMRRHGTTHSLVIHD 
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Ga MHC class | restricted 
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Extended Data Figure 1 | Non synonymous cancer-associated mutations 
are frequently immunogenic and pre-dominantly recognized by CD4* T 
cells. T-cell responses obtained by vaccinating C57BL/6 mice with antigen- 
encoding RNA in the B16F10 tumour model (n = 5). Left, prevalence of non- 
immunogenic, MHC-class-I- or class-I]-restricted mutated epitopes. Right, 
detection and typing of mutation-specific T cells (individual epitopes shown in 
Extended Data Table 1). 
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Extended Data Figure 2 | Mutant epitope-specific T cells induced by RNA 
vaccination control tumour growth. a, Splenocytes of mice (n = 5) vaccinated 
with B16-M30 RNA were tested by ELISpot for recognition of mutated peptides 
as compared to the corresponding wild-type (B16-WT30) sequence. Right, 
testing of truncated variants of B16-M30 (mean + s.e.m.). b, Mean + s.e.m. 
tumour growth (left) and survival (right) of C57BL/6 mice (n = 10) inoculated 
s.c. with B16F10 and left untreated (control) or injected i-v. with irrelevant 
RNA. c, Lungs of B16F10-Luc tumour bearing mice shown in Fig. 2b (day 27 
after tumour inoculation). d, Therapeutic antitumour activity against B16F10 


tumours in mice (B16-M27, Trp2 n = 8; B16-M30 n = 7; others n = 10) 
conferred by immunization with epitopes encoding immunogenic B16F10 
mutations or an immunodominant wild type Trp2 epitope®. The area under the 
tumour growth curve at day 30 after tumour inoculation was normalized to 
untreated control mice and depicted as mean = s.e.m. Red and black columns 
represent mutations recognized by CD8* or CD4* T cells, respectively. 

e, Spontaneous immune responses in splenocytes of irrelevant RNA treated 
B16F10 tumour bearing C57BL/6 mice (n = 3) were tested by ELISpot for 
recognition of peptides (mean + s.e.m.). 
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Extended Data Figure 3 | Mechanism of antitumour activity of mutation 
specific poly-epitope vaccines in CT26 tumour-bearing mice. a, BALB/c 
mice (m = 5) were vaccinated either with pentatope (35 |1g) or the corresponding 
mixture of five RNA monotopes (7 [1g each). T-cell responses in peptide- 
stimulated splenocytes of mice were measured ex vivo via ELISpot (medium 
control subtracted mean + s.e.m.). b, c, BALB/c mice (n = 10) were inoculated 
iv. with CT26 tumour cells and left untreated or injected with irrelevant, CT26- 
M19 or pentatopel or 2 RNA in absence (b) or presence of a CD8 T cell 
depleting antibody or a CD40L blocking antibody (c). Mean = s.e.m. of tumour 


Irrelevant Pentatope2 
RNA 


Irrelevant Pentatope2 
RNA 


nodules per lung are shown. d, Immunofluorescence analyses of tumour- 
infiltrating lymphocytes in pentatope2-vaccinated mice. Upper panel, lung 
tumour tissue stained for CD4 and CD8 or CD4 and FoxP3. Scale bar, 50 tum. 
Lower panel left, proportion of infiltrating cells in sections of irrelevant (CD4: 
n= 13; CD8 = 9; FoxP3: n = 13) or pentatope (CD4: n = 17; CD8: n = 6; 


FoxP3: n= 10) RNA- 


treated animals. Lower panel right, tumour area in 


sections of control (n = 22) and pentatope2-treated (n = 20) animals 


(mean + s.e.m.). 
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Extended Data Figure 4 | Immunogenicity testing of Pp pentatope- 
encoded mutations. Splenocytes of Pyyjz RNA vaccinated BALB/c mice (n = 6) 
were tested ex vivo for recognition of peptides representing the mutated 27mer 
sequences represented in P\yg pentatopes with or without addition of an MHC 
class II-blocking antibody. Mean + s.e.m. of background (medium control) 
subtracted responses are shown. 
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Extended Data Table 1 | Immunogenic B16F10 mutations 


Response after 


A io Substitution Reactive T Mel seore vaccination with 
Mutation Gene Mutated sequence used for vaccination (best 
(WT, AA#, Mut) cell subtype dicti 
prediction) beptide RNA 
B16-M05 Eef2 FVVKAYLPVNESFAFTADLRSNTGGOA G795A cD4+ 14 . 


B16-M12 Gnas TPPPEEAMPFEFNGPAQGDHSQPPLOV S111G CcD4* 


= 
ND 
bad 


B16-M20 Tubb3  FRRKAFLHWYTGEAMDEMEFTEAESNM G402A cD4* 


= 
o 
bad 


B16-M22 Asfib = PKPDFSQLORNILPSNPRVTRFHINWD A141P cD4* 


= 
N 
bad 


B16-M25 Plod1 STANYNTSHLNNDVWOIFENPVDWKEK F530V cD4* 


j=) 
ua 
bad 
x 


B16-M28 Ppp1r?7 = NIEGIDKLTQLKKPFLVNNKINKIENI L170P cD4* 


wo 
ND 
bad 
bad 


B16-M30 Kifi8b § PSKPSFQEFVDWENVSPELNSTDOPFL K739N cD4* 


= 
ND 
bad 
x 


B16-M36 Tm9sf3  CGTAFFINFIAIYHHASRAIPFGIMVA Y382H CD4* 


j=) 
NO 
bad 


B16-M45 Mkrn1 ECRITSNFVIPSEYWVEEKEEKQKLIQ N346Y cD4* 


= 
a 
x< 


B16-M47 Rpli3a GRGHLLGRLAAIVGKQVLLGRKVVVVR A24G cD4+ 


i=) 
a 
x 


B16-M50 Sema3b GFSQPLRRLVLHVVSAAQAERLARAEE L663V cb4* 


Ld 
© 
x 
x 


B16F10 mutations determined to be immunogenic upon peptide or RNA immunization (see Fig. 1 and Extended Data Fig. 1). WT, wild type; AA#, position of mutated amino acid; Mut, mutation. 
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Extended Data Table 2 | Immunogenic CT26 mutations 


Substitution Reactive T cell MHC | score 


Mutation Gene Mutated sequence used for vaccination (WT, AA#, Mut) subtype (best prediction) 


CT26-M12 Gpc1 YRGANLHLEETLAGFWARLLERLFKQL E165G CD8* 


= 
o 


CT26-M19 Tmem87a  QAIVRGCSMPGPWRSGRLLVSRRWSVE G63R CbD8+ 


i 


CT26-M24 Cxcr7 MKAF IFKY SAKTGFTKLIDASRVSETE L340F cD4+ 


= 
foe} 


CT26-M27 Agxt2I2 EHIHRAGGLFVADAIQVGFGRIGKHFW E247A cbD4* 


id 
iN) 


CT26-M37 Dhx35 EVIQTSKYYMRDVIAIESAWLLELAPH T6461 cD4+ 


S 
an 


CT26-M42 Deptor SHDSRKSTSFMSVNP SKELKIVSAVRR $253N CcD4+ 


oS 
wo 


CT26-M55 Dkk2 EGDPCLRS SDCIDEFCCARHFWIKICK G192E cbD4* 


Al 
= 


CT26-M68 Steap2 VISIPSVSNALNWKEFSF IQSTLGYVA R388K CD4* 


@ 
fo) 


CT26-M78 Nbea PAPRAVLTGHDHEIVCVSVCAELGLVI V576l cD4* 


2 
wo 


CT26-M91 Zc3h14 NCKYDTKCTKADCLF THMSRRASILTP P497L cD4* 


@ 
fo) 


CT26 mutations determined to be immunogenic upon RNA immunization (see Fig. 1). WT, wild type; AA#, position of mutated amino acid; Mut, mutation. 
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Extended Data Table 3 | Immunogenic 4T1 mutations 


Substitution Reactive T cell 


Mutation Gene Mutated sequence used for vaccination (WT, AA#, Mut) eubiyps 


4T1-M3 Polr2a LAAQSLGEPATQITLNTFHYAGVSAKN M1102! 


i) 
o] 
BK 


471-M14 


N 
= 


AHIRGAKHQKVVTLHTKLGKPIPSTEP K411T 


i) 
1s] 
BK 


AT1-M17 Malt1 FLKDRLLEDKKIAVLLDEVAEDMGKCH T534A 


i) 
1s] 
BK 
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(@) 
7] 
BK 
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2) 
0 
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4T1-M30 Enho MGAAISQGAIIAIVCNGLVGFLL L101 


ie) 
1] 
BK 


47T1-M32 Rragd SHRSCSHQTSAPSPKALAHNGTPRNAI L268P 


i) 
7] 
BK 


ie) 
7] 
BK 


4T1-M39 lIkap RKGEREEMQDAHVSLNDITQECNPPSS 1278 


AT1 mutations determined to be immunogenic upon RNA immunization (see Fig. 1). WT, wild type; AA#, position of mutated amino acid; Mut, mutation. 
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Extended Data Table 4 | CT26 mutated epitopes encoded in pentatope 1+2 


Pentatope Mutation Gene Mutated sequence used for Substitution Reactive T cell 
vaccination (WT, AA#, Mut) subtype 
cc 
1 CT26-M39 Als2 GYISRVTAGKDSYIALVDKNIMGYIAS L675l CDs* 
ieee i ae en a) 
1 CT26-M55 Dkk2 EGDPCLRSSDCIDEFCCARHFWTKICK G192E cCD4* 
i ee se Se ere aes 
2 CT26-M20 Sic4a3 PLLPFYPPDEALEIGLELNSSALPPTE 373! cb4+ 
DS ieee eee ea 
2 CT26-M03 Slc20a1 DKPLRRNNSYTSY IMAICGMPLDSFRA T4251 cD4+ 
a ees 
2 CT26-M27 Agxt2|2 EHIHRAGGLFVADAIQVGFGRIGKHFW E247A CD4* 


Ten immunogenic mutated epitopes were used for generation of two pentatopes used for therapeutic vaccination shown in Fig. 3b. WT, wild type; AA#, position of mutated amino acid; Mut, mutation. 
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Extended Data Table 5 | In silico prediction of CT26 mutations with abundant expression and favourable MHC class II binding properties 


Substitution Expression MHC II score 


Mutation Gene Mutated sequence used for vaccination (WT, AA#, Mut) (NVRC) (best prediciton) 


CT26-E2 Cd34 PQTSPTGILPTTSNSISTSEMTWKSSL D120N 1150,85 23,76 


CT26-E4 Tmbim6 SALGSLALMIWLMTTPHSHETEQKRLG A73T 825,51 2,96 


CT26-E16 Eif4g2 KLCLELLNVGVESNLILKGVILLIVDK K108N 327,79 20,99 


CT26-E18 Fn1 YTVSVVALHDDMENQPLIGIQSTAIPA S1710N 303,62 17,41 


CT26-E20 Uchl3 KFMERDPDELRFNTIALSAA A224T 301,78 9,75 


CT26-ME2 UbqIn1 DTLSAMSNPRAMQVLLOIQQGLOTLAT A62V 84,08 0,24 


CT26-ME4 Trip12 WKGGPVKIDPLALMQAIERYLVVRGYG V1328M 83,09 0,49 


CT26-ME6 Cad SDPRAAYFROQAENDMY IRMALLATVLG G2139D 152,86 0,55 


CT26-ME8 Ddx27 ITTCLAVGGLDVKFQEAALRAAPDILI $297F 61,82 0,62 


CT26-ME10 Lin7c GEVPPQKLOALQRALOQSEFCNAVREVY V41A 71,24 1,09 


CT26 mutations selected for high expression with (ME) or without (E) consideration of the MHC II percentile rank (IEDB consensus version 2.5). WT, wild type; AA#, position of mutated amino acid; Mut, mutation. 
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TP53 loss creates therapeutic vulnerability 


in colorectal cancer 


Yunhua Liu’, Xinna Zhang”, Cecil Han', Guohui Wan’, Xingxu Huang", Cristina Ivan?*, Dahai Jiang”, 
Cristian Rodriguez-Aguayo*”, Gabriel Lopez-Berestein®’, Pulivarthi H. Rao®, Dipen M. Maru’, Andreas Pahl®, Xiaoming He’, 


Anil K. Sood!7, Lee M. Ellis!°, Jan Anderl® & Xiongbin Lu’? 


TP53, a well-known tumour suppressor gene that encodes p53, is 
frequently inactivated by mutation or deletion in most human 
tumours”. A tremendous effort has been made to restore p53 
activity in cancer therapies*’. However, no effective p53-based 
therapy has been successfully translated into clinical cancer treat- 
ment owing to the complexity of p53 signalling. Here we dem- 
onstrate that genomic deletion of TP53 frequently encompasses 
essential neighbouring genes, rendering cancer cells with hemizy- 
gous TP53 deletion vulnerable to further suppression of such 
genes. POLR2A is identified as such a gene that is almost always 
co-deleted with TP53 in human cancers. It encodes the largest and 
catalytic subunit of the RNA polymerase II complex, which is spe- 
cifically inhibited by a-amanitin*’. Our analysis of The Cancer 
Genome Atlas (TCGA) and Cancer Cell Line Encyclopedia 
(CCLE) databases reveals that POLR2A expression levels are 
tightly correlated with its gene copy numbers in human colorectal 
cancer. Suppression of POLR2A with a-amanitin or small inter- 
fering RNAs selectively inhibits the proliferation, survival and 
tumorigenic potential of colorectal cancer cells with hemizygous 
TP53 loss in a p53-independent manner. Previous clinical applica- 
tions of g-amanitin have been limited owing to its liver toxicity”. 
However, we found that a-amanitin-based antibody-drug conju- 
gates are highly effective therapeutic agents with reduced toxicity". 
Here we show that low doses of a-amanitin-conjugated anti-epi- 
thelial cell adhesion molecule (EpCAM) antibody lead to complete 
tumour regression in mouse models of human colorectal cancer 
with hemizygous deletion of POLR2A. We anticipate that inhib- 
iting POLR2A will be a new therapeutic approach for human can- 
cers containing such common genomic alterations. 

Genomic deletion of a tumour suppressor gene often encompasses 
several neighbouring genes that may not contribute to cancer develop- 
ment, but are essential for cell proliferation and survival’. This partial 
loss of essential genes has been postulated to render cancer cells highly 
vulnerable to further inhibition of those genes'*'*. Analysis of TCGA 
revealed that hemizygous deletion of the TP53 gene occurs frequently 
in human cancers (Fig. la). We identified POLR2A as an essential gene 
in the proximity of TP53 (Fig. 1b). Concomitant deletion of POLR2A 
occurs in virtually all the human colorectal tumours containing hemi- 
zygous TP53 deletion (Fig. 1c). 

Among the 12 subunits in the human RNA polymerase II complex, 
POLR2A encodes the largest subunit that is indispensable for the poly- 
merase activity in messenger RNA synthesis. Inhibiting POLR2A with 
a specific inhibitor, «-amanitin, causes extensive cell death, and homo- 
zygous deletion of POLR2A is lethal in human cells”**. We found that 
104 (53%) out of 195 colorectal cancer (CRC) cases bear hemizygous 
loss of the 17p13 region, resulting in concomitant deletion of TP53 and 


POLRZ2A (Fig. 1c). However, no homozygous deletion of POLR2A was 
observed, consistent with the notion that POLR2A is essential for cell 
survival. Analysis of TCGA and CCLE databases revealed that express- 
ion of POLR2A is tightly correlated with its gene copy number 
(Fig. 1d). This positive correlation was also validated in 20 pairs of 
matched normal and CRC tissue samples and human CRC tissue 
microarray (Fig. le and Extended Data Fig. 1). POLR2A'°* (hemizy- 
gous loss) cell lines expressed POLR2A proteins at significantly lower 
levels than POLR2A™™ cell lines (Fig. 1f-h). Unlike POLR2A, p53 
levels are determined by post-transcriptional and post-translational 
events'®. Despite a correlation between TP53 copy number and 
mRNA expression, p53 protein levels are not associated with its gene 
copy numbers in human CRC (Extended Data Fig. 2 and Fig. 1h). 

To assess the sensitivity of cells to POLR2A inhibition, a panel of 
POLR2A"*“"" (HCT116, SW480) and POLR2A'** (SW837, SNU283) 
cells were treated with «-amanitin. Treatment of o%-amanitin at high 
concentrations (=1 jig ml) caused complete cell death in all four cell 
lines. However, at concentrations ranging from 0 to 1.0pgml ', 
a-amanitin inhibition had significantly higher levels of cell-killing 
effect on the POLR2A’° cells than on the POLR2A"™™* cells 
(Fig. 2a, b). The half-maximum inhibitory concentration (IC59) was 
~1.0 1g ml | for the POLR2A"™"™ cells, which was tenfold greater 
than that of the POLR2A°® cells. By contrast, the POLR2A!°® cells did 
not show any greater sensitivity to the treatment of actinomycin D, a 
nonspecific transcription inhibitor (Extended Data Fig. 3a). In direct 
competition assays, the POLR2A™™ cells (HCT116, SW480) stably 
expressing POLR2A short hairpin RNAs (shRNAs) only had modestly 
reduced proliferation, in comparison with that of the corresponding 
cells expressing control shRNAs (Extended Data Fig. 3b, c). However, 
silencing POLR2A in the POLR2A"°® cells (SNU283, SW837) led to 
markedly reduced proliferation. We generated HCT116 and SNU283 
cell lines stably expressing doxycycline (Dox)-inducible POLR2A 
shRNAs (Extended Data Fig. 3d). Despite significant knockdown of 
POLR2A, HCT116 cells continued to proliferate, whereas SNU283 
cells exhibited severe G1 cell cycle arrest and apoptosis (Fig. 2c, d 
and Extended Data Fig. 3e-g). Approximately 50% of the decrease 
in POLR2A expression (30-100 ng ml! of Dox) remarkably reduced 
the proliferation of SNU283 cells, but only had a modest effect on 
HCT116 cells (Fig. 2d). Results of rescue experiments demonstrated 
that gradual re-expression of exogenous POLR2A in SNU283 and 
SW837 cells restored their resistance to &-amanitin up to a level com- 
parable to that of the POLR2AR™™! cells (Fig. 2e, f and Extended 
Data Fig. 4). 

To exclude genetic difference across cell lines, we used the CRISPR 
(clustered regularly interspaced short palindromic repeat)/Cas9 sys- 
tem to generate an isogenic HCT116 cell line with hemizygous loss of 
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Figure 1 | Expression of POLR2A, but not TP53, is correlated with the gene 
copy number. a, Frequencies of hemizygous deletion of TP53 in human 
cancers. AML, acute myeloid leukaemia; DLBC, diffuse large B-cell lymphoma; 
GBM, glioblastoma. b, Schematic diagram of genes adjacent to TP53 in human 
genome. c, Concomitant deletion of POLR2A in human CRCs containing 
hemizygous loss of TP53. d, Scatterplots of POLR2A copy number versus 


POLR2A (refs 17, 18) (Extended Data Fig. 5a—c). POLR2A expression 
was significantly reduced in the POLR2A°* HCT116 cells (Extended 
Data Fig. 5d). The POLR2A’° and parental HCT116 cells exhibited 
similar proliferation rates (Extended Data Fig. 5e), indicating that one 
allele of POLR2A is sufficient to maintain cell proliferation. However, 
hemizygous deletion of POLR2A markedly sensitized HCT116 cells to 
o-amanitin treatment with an ICs value of ~0.1 p1g ml’, which is 
eightfold lower than that of parental HCT116 cells (Fig. 2g, h). As a 
control, no substantial difference on their sensitivity to actinomycin D 
was observed (Extended Data Fig. 5f). Knockdown of POLR2A 
resulted in profoundly reduced proliferation in the POLR2A’° cells, 
but not in the isogenic POLR2A"""™ cells (Extended Data Fig. 5g). In 
the isogenic pair of HCT116 cell lines expressing Dox-inducible 
POLR2A shRNAs, a low-dose of Dox (100 ng ml‘) was sufficient to 
remarkably kill the POLR2A'°* cells, while high doses of Dox only had 
minimal effects on the POLR2A"™"™' cells (Fig. 2i, j and Extended 
Data Fig. 5h). 

Blockage of RNA polymerase may lead to p53 accumulation and 
activation’’. To examine the effects of p53, we recapitulated the con- 
comitant deletion of TP53 and POLR2A in HCT116 and xenografted 
human primary CRC (xhCRC) cell lines (Fig. 3a and Extended Data 
Fig. 6a-c). The xhCRC cell line (TP53*’* POLR2A*’*), established 
from a freshly isolated xenografted human colon tumour, demon- 
strated enhanced tumorigenicity in vivo”®. Except for slightly increased 
cell proliferation, no significant changes in their sensitivity to o-ama- 
nitin were observed in either xhCRC or HCT116 cells with hemizygous 
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mRNA expression in TCGA and CCLE databases. Pearson correlation 
coefficients (r) and P values are shown. e, POLR2A protein levels in matched 
normal and CRC tissue samples. Error bars denote s.d. f, g, Copy numbers 
(f) and relative mRNA expression levels (g) of POLR2A in human CRC cell 
lines. Data are mean and s.d. of three independent experiments. h, Protein 
levels of POLR2A and p53 in human CRC cell lines. 


deletion of TP53. By contrast, hemizygous loss of POLR2A markedly 
sensitized these cells to o-amanitin treatment regardless of their TP53 
status (Fig. 3b and Extended Data Fig. 6d-g). The mRNA synthesis 
activity of the RNA polymerase II is essential to any type of cells 
including therapy-resistant tumour cells. We examined the drug sens- 
itivity of POLR2A"™"™ and POLR2A'°® cells to three major chemo- 
therapy drugs for CRC: 5-fluorouracil (5-FU), oxaliplatin and 7-ethyl- 
10-hydroxy-camptothecin (SN-38). Inhibition of POLR2A by a-ama- 
nitin significantly enhanced the cell-killing effects of all three drugs in 
the POLR2A'* xhCRC cells, but had no notable effects on the 
POLR2A™""™' cells (Fig. 3c), suggesting therapeutic vulnerability of 
POLR2A"°* colorectal tumours. Free o-amanitin is toxic to liver owing 
to its interaction with OATP1B3, a transporter exclusively expressed 
on the membrane of hepatocytes’®. However, when conjugated with 
specific antibodies, «-amanitin is no longer a substrate for OATP1B3 
(refs 10, 11, 21). This strategy overcomes the toxicity of x-amanitin for 
clinical applications. We used o-amanitin conjugated to a monoclonal 
antibody (HEA125) against EpCAM, a cancer antigen overexpressed 
in adenocarcinomas’’”*. The ama-HEA125 conjugate selectively 
killed the POLR2A'°S xhCRC and HCT116 cancer cells in a p53- 
independent manner, and reduced the effective doses of o-amanitin 
by at least 10,000-fold (IC5) ~ 0.01 ngml~') in vitro (Fig. 3d and 
Extended Data Fig. 6h). 

To test the anti-tumour effect of POLR2A inhibition in vivo, 
HCT116 and SNU283 cells expressing Dox-inducible POLR2A 
shRNA were injected subcutaneously into non-obese diabetic/severe 
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Figure 2 | POLR2A’°* cells are highly sensitive to 
the POLR2A inhibition. a, b, POLR2A'*™ cells 
(SW837, SNU283) are significantly more sensitive 
to &-amanitin treatment than POLR2A™™"™' cells 
(HCT116, SW480). Crystal violet staining of cells 
(a) and quantification analyses (b) are shown. 

c, Dox-induced suppression of POLR2A inhibited 
the proliferation of SNU283 cells, but not of 
HCT116 cells. shNT denotes non-targeting control 
shRNA. d, Correlation between POLR2A mRNA 
expression and cell proliferation in HCT116 and 
SNU283 cells expressing Dox-inducible POLR2A 
shRNA. e, f, Survival curves of SW837 (e) and 
SNU283 (f) cells treated with o-amanitin after 
transfection with increasing amounts of PORL2A 
expression vector DNA. g, h, POLR2A°S HCT116 
cells are significantly more sensitive to o-amanitin 
treatment than the parental POLR2A"™™"™ cells. 
Crystal violet staining of cells (g) and 
quantification analyses (h) are shown. 

i, j, Correlation between POLR2A mRNA 
expression and cell | proliferation (i) or apoptosis 
(j) in POLR2A""™™ and POLR2A°S HCT116 
cells. Data are mean and s.d. of three independent 
experiments. 


Figure 3 | The sensitivity of POLR2A™ cells to 
POLR2A inhibition is independent of p53. 

a, Protein levels of POLR2A, p53 and EpCAM ina 
panel of isogenic human xhCRC cell lines. b, Cell 
proliferation of isogenic xhCRC cells treated with 
a.-amanitin. c, Sensitivity of isogenic xhCRC cells to 
5-FU, oxaliplatin (Oxa) or SN-38 treatment 
combined with or without o%-amanitin (Ama). 

d, Cell proliferation of isogenic xhCRC cells treated 
with ama—HEA125. Data are mean and s.d. of three 
independent experiments. 
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Figure 4 | Suppression of POLR2A selectively inhibits the POLR2A°* 
tumour growth. a, b, Gross tumour images (a) and growth curves (b) of 
xenograft tumours derived from subcutaneously implanted HCT116 or 
SNU283 cells expressing control (ctrl) or Dox-inducible POLR2A shRNA. KD, 
knockdown. n = 5 mice per group. Error bars denote s.e.m. c-e, Tumour 
growth curves (c, **P < 0.01, error bars denote s.e.m.), gross tumour images 
(d) and weights (e, error bars denote s.d.) of xenograft tumours derived from 
orthotopically implanted POLR2A"*""*' and POLR2A°S HCT116 cells 


combined immunodeficiency (NOD/SCID) mice. After initial tumour 
establishment, administration of Dox (1.0 1g ml yin drinking water 
suppressed POLR2A expression and consequently inhibited the 
growth of SNU283-derived tumours (Fig. 4a, b and Extended Data 
Fig. 7a). However, no substantial differences were observed between 
control and POLR2A-knockdown HCT116-derived tumours. 
Histopathological analyses demonstrated that POLR2A-knockdown 
SNU283 tumours had significantly reduced cell proliferation, but more 
apoptotic cells, as compared with the corresponding control tumours 
(Extended Data Fig. 7b, c). By contrast, no significant changes were 
observed in the control or POLR2A-knockdown HCT116 tumours. 
However, hemizygous deletion of POLR2A sensitized HCT116-derived 
tumours to Dox treatment (Extended Data Fig. 7d, e). Next, we used an 
orthotopic tumour model by injecting POLR2A"™"" and POLR2A'°S 
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expressing Dox-inducible control or POLR2A shRNA (shNT or shRNA, 
respectively). n = 5 mice per group. f-i, Representative bioluminescent images 
(f, h) and tumour growth curves (g, i) of orthotopic xenograft tumours derived 
from POLR2A"™*! and POLR2A° HCT116 (£, g) or xhCRC cells (h, i) that 
received dual intraperitoneal injections of HEA125 antibody or ama~HEA125 
antibody-drug conjugate (3, 10, 30 and 90 pg kg '). n = 10 mice per group. 
Error bars denote s.e.m. 


HCT116 cells into the caecal wall of NOD/SCID mice. In vivo tumour 
imaging demonstrated that Dox-induced POLR2A inhibition led to a 
significant decrease in tumour growth kinetics in the POLR2A'°S 
tumours, but not in the control POLR2A™™ tumours (Fig. 4c-e 
and Extended Data Fig. 7f). To test the efficacy of POLR2A 
silencing in vivo further, we used a nanoliposomal delivery platform, 
DOPC (1,2-dioleoyl-sn-glycero-3-phosphatidylcholine), for systemic 
delivery of POLR2A siRNAs” (Extended Data Fig. 8a, b). After four 
weeks of systemic therapy, compared with control short interfering 
RNA (siRNA) and DOPC treatment, mice in the 125ugkg' of 
POLR2A-siRNA-DOPC treatment groups had pronounced growth 
reduction of POLR2A‘° tumours, while POLR2A"™"*' tumours only 
had significant growth inhibition even at the dose of 1,000 ngkg ' 
(Extended Data Fig. 8c-h). 
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Next, we investigated the anti-tumour activity of ama-HEA125 in 
orthotopic tumour models established by isogenic pairs of xhCRC and 
HCT116 cells (TP53/POLR2A""" and TP53/POLR2A'°*) (Fig. 4f-i 
and Extended Data Fig. 9a-c). In a dose-escalation experiment, 
ama-HEA125 was administered to the tumour-bearing mice as dual 
intraperitoneal injections at the dose of 3, 10, 30 or 90 lg with respect 
to %-amanitin per kilograms of mouse body weight. In the mice bear- 
ing POLR2A™ tumours, control HEA125-treated mice showed 
continuous tumour growth within 25 days after antibody injection. 
The treatment of ama~HEA125 showed strong inhibition on tumour 
growth even at the lowest dose of 3ugkg '. All the POLR2A'°S 
tumours responded to ama—HEA125 treatment, and tumour volume 
regressed markedly (Fig. 4f-i). Complete tumour regression was 
observed in 10 out of 10 (90 pg kg” '), 8 out of 10 (30 pg kg” ') and 6 
out of 10 (10ugkg ') mice bearing POLR2A'S HCT116 tumours 
25 days after the first administration of ama-~HEA125 (Fig. 4f, g). By 
contrast, significant tumour inhibition was observed in the mice bear- 
ing POLR2A"™"™* tumours only at the highest dose of 90 ug kg” ', but 
not at the doses of 3-30 jigkg”*. Similar results were observed in the 
mice bearing xhCRC-derived tumours (Fig. 4h, i). Consistent with our 
previous study”, treatment ofama—HEA125 at the tested doses had no 
notable toxicity in vivo. Analysis of body weights and blood liver 
enzymes did not reveal any substantial differences between the ama- 
HEA125-treated group and the control HEA125-treated group 
(Extended Data Fig. 9d, e), suggesting negligible systemic toxicity of 
the ama-HEA125 conjugate. In addition, we also tested the anti- 
tumour activity of ama—~HEA125 in orthotopic tumours derived from 
the POLR2A™“"™ $W480 cells and the POLR2A'°* SW837 cells 
(Extended Data Fig. 10). Treatment of ama-HEA125 at the 10 ugkg 
dose was sufficient to inhibit tumour growth in all the POLR2A°* 
tumours, while POLR2A"™™ (SW480) or POLR2A-restored (SW837 
expressing exogenous POLR2A) tumours were only significantly 
inhibited by ama-HEA125 at the dose of 90 ugkg '. 

Much effort has been directed towards restoring p53 activity in 
human cancer cells**”**. Instead of pharmacological intervention of 
p53 signalling, the principle of collateral vulnerability to POLR2A 
inhibition provides a novel strategy for cancer therapy. Our preclinical 
studies with «-amanitin demonstrate therapeutic efficacy of the strat- 
egy for treating CRC. Given that hemizygous loss of TP53 is a frequent 
genomic event in many human cancers, the therapeutic strategy 
described here may have broad applications. While clinical applications 
of «-amanitin are limited by its liver toxicity, conjugation of «-amanitin 
to antibodies against tumour-specific biomarkers minimizes its sys- 
temic toxicity and, moreover, increases the efficacy of treatment. The 
identification of new pharmacological agents that target POLR2A 
should significantly impact personalized cancer therapies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell culture, antibodies and western blot analysis. HCT116, SW480, SW837, 
HT29, DLD1 and HT29 cell lines were obtained from the American Type Culture 
Collection and cultured under standard conditions specified by the manufacturer. 
SNU283 and SNU1197 cell lines were obtained from the Korean Cell Line Bank 
and cultured in RMPI 1640 medium supplemented with 10% FBS and 2 mmol1"' 
L-glutamine. Isolation, culture and maintenance of xhCRC cells were performed as 
previously described”°. In brief, patient-derived xenografts were collected under 
sterile conditions and mechanically dissociated, followed by 30 min of incubation 
in collagenase II at 37°C. The specimen was filtered through a sterile 100-j1m 
strainer. Red blood cells were eliminated with a hypo-osmotic red blood cell lysis 
buffer (eBioscience). Mouse cells in xenografted human CRC specimens were 
removed by negative selection using a mouse MHC class I molecule H-2K anti- 
body followed by use of a magnetic bead purification kit (Miltenyi). Fibroblasts 
were removed by negative selection using a MACS magnetic bead separation kit 
(Miltenyi). The freshly isolated xhCRC cells were maintained on collagen-1- 
coated culture plates (BD Biosciences) and cultured in MEM supplemented with 
10% FBS, vitamins (1X), nonessential amino acids (1X), penicillin-streptomycin 
(1X), sodium pyruvate (1X) and L-glutamine (1X). All medium supplements 
were purchased from Sigma. 

Anti-POLR2A antibodies were purchased from Santa Cruz (sc-47701) and 
Abcam (ab140509). Anti-Ki67 antibody (D3B5) and anti-cleaved caspase-3 
(Asp175, 5A1E) antibody were obtained from Cell Signalling. Anti-p53 (sc- 
126), anti-B-actin (sc-1616), horseradish peroxidase (HRP)-anti-goat IgG (sc- 
2020), HRP-anti-rabbit IgG (sc-2054), and HRP-anti-mouse IgG (sc-2055) anti- 
bodies were purchased from Santa Cruz. Cell lysate preparation, SDS-PAGE and 
western blotting were performed as previously described”. 

RNA isolation and quantitative PCR. Total RNA was isolated using TRIzol 
reagent (Life Technologies) and then reverse-transcribed using iScript cDNA 
Synthesis Kit (Bio-Rad). The resulting cDNA was used for qPCR using iTaq 
Universal SYBR Green Supermix (Bio-Rad) with gene-specific primers and the 
results were normalized to f-actin control. In reverse transcription PCR (RT- 
PCR) assays, the primer sequences for TP53 are: 5'-GAGGTTGGCTCTGACTGT 
ACC-3' and 5'-TCCGTCCCAGTAGATTACCAC-3’, and for POLR2A are: 5'- 
TTGTATCCGTACCCACAGCA-3' and 5’-CATGATCAGCTCCCCATTCT-3’. 
shRNA-mediated knockdown of POLR2A. POLR2A-specific shRNA clones 
were obtained from the MD Anderson shRNA and ORFeome Core Facility (ori- 
ginally from Open Biosystems). Twelve shRNAs targeting POLR2A were screened, 
of which two shRNAs knocked down protein levels by at least 50% in all 
four colorectal cancer cell lines tested. Their clone identification numbers and 
shRNA sequences are V3LHS_645674 (5'-TTAGCTTTGTTCTTCCCGA-3’) 
and V3LHS_640293 (5'-TGITGTCCATCTCCTCCCC-3’). The hairpin 
sequences in the GIPZ vector were cloned into the TRIPZ vector (Dharmacon) 
using a protocol provided by the manufacturer. The TRIPZ vector is a Dox- 
inducible system with a red fluorescent protein reporter. 

Generation of cells stably expressing Dox-inducible shRNAs. Recombinant 
lentiviral particles were produced from 293T cells. In brief, 72 1g of shRNA- 
encoding vector DNA, 54 jig of Delta 8.9 vector DNA and 18 ig of VSVG-encod- 
ing vector DNA were transfected into 293T cells (plated in the 245-mm” dish) 
using X-tremeGENE (Roche). Supernatant containing virus particles was collected 
and filtered 72 h after transfection, concentrated by ultracentrifugation at 90,000g, 
and resuspended in cell growth medium. To generate stable Dox-inducible cells, 
HCT116 and SNU283 cells were infected with shRNA-expressing viral particles at 
a multiplicity of infection (MOD) of 1. Viral solutions were added to cell culture 
medium containing 4 jg ml * polybrene. Cells were selected 48 h after infection 
by 2pgml~' puromycin. Single colonies were cultured and propagated, and col- 
onies bearing a single copy of lentiviral DNA insert were identified and analysed 
for knockdown efficiency. 

Direct competition assay using POLR2A shRNA. A single lentiviral copy 
expressing either shRNA-1 or shRNA-2 was sufficient to suppress POLR2A 
expression levels in all the four colorectal cell lines tested (Extended Data Fig. 
3b). Cancer cells were infected with control or POLR2A shRNA-expressing lenti- 
viruses (pGIPZ backbone expressing green fluorescent protein (GFP)) at the MOI 
of 2. Two days after infection, GFP-positive cells were sorted using a BD FACSJazz 
cell sorter (BD Biosciences) at the MD Anderson Flow Cytometry and Cellular 
Imaging Core. Next, GFP-positive cells were mixed with non-infected and GFP- 
negative cells at the ratio of 1:1 and cultured for six passages. The numbers of GFP- 
positive and total cells at each passage were analysed and quantified by flow 
cytometry and the percentages of GFP-positive cells were calculated. 
Generation of sgRNA-expressing vectors and Surveyor assay. Bicistronic 
expression vector expressing Cas9 and single-guide RNA (sgRNA) was digested 
with BbsI and treated with alkaline phosphatase, and then the linearized vector 
was gel-purified for cloning sgRNA-encoding DNA”. The pair of oligonucleotide 


DNA for each sgRNA targeting TP53 or POLR2A was annealed, phosphorylated 
and ligated to the linearized vector. The sequences of oligonucleotide DNA are 
listed in Supplementary Table 1. Surveyor assay was performed to test the genome 
editing efficacy as previously described”””*. In brief, cells were seeded into six-well 
plates at a density of 2 X 10° cells per well. One day after initial seeding, cells were 
transfected with 2 jig of Cas9/sgRNA-expressing vector DNA and collected 48h 
after transfection. Genomic DNA was isolated and a 1-kb DNA fragment contain- 
ing the sgRNA-targeting site was amplified by high-fidelity PCR and digested by 
T7 endonuclease I. Genomic DNA isolated from HCT116 cells transfected with 
control vector DNA was used as control. To allow complementary but mis- 
matched strands to anneal, PCR products were incubated at 95°C for 5 min, 
95°C to 85°C at the rate of —2°Cs ' and 85°C to 25°C at the rate of 
—0.1°Cs_'. T7 endonuclease I was added and samples were incubated at 37 °C 
for 60 min to digest the annealed PCR products at the mismatch sites. T7-endo- 
nuclease-I-digested PCR products were analysed by agarose gel electrophoresis. 
Oligonucleotide sequences used for PCR amplification are listed in Supplementary 
Table 1. The PCR products from positive clones were ligated to pGEM-T Easy 
Vector (Promega) and further confirmed by DNA sequencing. 

Cell proliferation and survival assay. Equal numbers of cells were plated in 12- 
well plates in triplicate. Cells were fixed with 10% methanol and stained with 0.1% 
crystal violet (dissolved in 10% methanol) at indicated times. After staining, wells 
were washed three times with PBS and destained with acetic acid. The absorbance 
of the crystal violet solution was measured at 590 nm. For cell survival assay, cells 
were seeded at a concentration of 1,000 cells per well in 96-well plates and treated 
with indicated concentrations of «-amanitin or actinomycin D 24h later. Cell 
viability was quantified using WST-1 reagent (Roche) according to the manufac- 
turer’s instructions. All experiments were performed in triplicate. 

Apoptosis and cell cycle analysis. The HCT116 and SNU283 cell lines were 
treated with or without «-amanitin for 2 days or doxycycline for 4 days at indicated 
concentrations and stained with annexin-V-phycoerythrin (PE) and 7-AAD 
(Biovision). Apoptosis was analysed by flow cytometry using a Guava EasyCyte 
Flow Cytometer (Millipore) according to the manufacturer’s protocol. Both pre- 
apoptotic (annexin-V-positive and 7-AAD-negative) and apoptotic (annexin-V- 
positive and 7-AAD-positive) cells were included in the analyses. For cell cycle 
analysis, cells were fixed in 75% ethanol at —20°C overnight. The cells were 
washed with cold PBS, treated with 100 pig of RNase A (Qiagen), and stained with 
50g of propidium iodide (Roche). Cell cycle profiles were analysed by flow 
cytometry using the Guava EasyCyte Flow Cytometer (Millipore). 

Fluorescence in situ hybridization. FISH analysis was performed using fluor- 
escein-labelled POLR2A (red) and control centromere (chromosome 17, green) 
probes from Empire Genomics. Hybridization and detection were performed 
according to the manufacturer’s protocols. The slides were counterstained with 
4',6-diamidino-2-phenylindole (DAPI), and the images were captured using a 
Nikon E800 microscope equipped with a cooled-charge coupled device (CCD) 
camera. To determine hemizygous loss of the POLR2A gene, 100 individual nuclei 
were analysed for each case. The interphase nuclei were captured and processed 
using the Quantitative Image Processing System (Applied Imaging). 

Patient samples. Matched normal and colorectal tumour tissue samples from 
patients were obtained from the MD Anderson Cancer Center (MDACC) through 
appropriate informed consents after approval by the Institutional Review Board 
(IRB PA11-0767). To determine the expression levels of POLR2A protein, tissue 
samples (20-40 mg) were placed in tubes containing ceramic beads and were homo- 
genized using a Precellys 24 homogenizer device (Bertin Technologies). The lysates 
were spun-down twice (15 min, 16,000g) and the supernatant was collected. 
Genomic DNA isolation and copy number validation. Total genomic DNA was 
extracted from human tissue specimens and cell lines using DNeasy Blood & 
Tissue Kit (Qiagen) according to the manufacturer’s purification instructions. 
All the DNA samples were stored at —20°C. The copy number variations for 
POLR2A were determined using TaqMan probes (Hs02023849_cn and 
Hs01252684_cn) and standard TaqMan PCR kit on an Applied Biosystems 
7900HT Sequence Detection System. And the reference gene TERT was simulta- 
neously quantified in the same reaction for each DNA sample. 

Conjugation of a-amanitin to anti-EpCAM antibody (HEA125). Antibody- 
drug conjugate ama-~HEA 125 was constructed by coupling of «-amanitin to lysine 
residues of the HEA125 antibody by a stable linker structure. HEA125 binds to 
EpCAM-expressing cells with high affinity (Kq ~ 2.2 X 10~° moll~') and high 
specificity. HEA125 was purified by affinity chromatography using a protein 
A-Sepharose CL-4B column (GE Healthcare). «-amanitin was attached to 
immunoglobulin molecules by a plasma stable urea linkage intended to release 
free o-amanitin inside the tumour cell after lysosomal degradation of the antibody 
moiety. The drug-antibody ratio of the a-amanitin:lgG molecule was 4:1. 
Biochemical characteristics of ama-~HEA125 were evaluated by high performance 
liquid chromatography (HPLC) using a PlatinBlue HPLC system (Knauer). In 
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addition, HEA125 and ama—HEA125 were analysed by reducing SDS-PAGE and 
Coomassie staining according to common procedures. Verification of drug-load- 
ing was done by anti-amanitin immunoblotting analysis of 30 ng HEA125 and 
ama-HEA125 using standard techniques. 

Liposomal nanoparticle preparation. siRNAs for in vivo delivery were encapsu- 
lated into DOPC. DOPC and siRNA were mixed in the presence of excess tertiary 
butanol at a ratio of 1:10 (w/w) siRNA:DOPC. Tween 20 was added to the mixture 
in a ratio of 1:19 Tween 20:siRNA-DOPC. The mixture was vortexed, frozen in an 
acetone/dry-ice bath, and lyophilized. Before in vivo administration, this prepara- 
tion was hydrated with PBS at room temperature at a concentration of 150- 
1,000 1g siRNA per kilogram per injection (each mouse received 200 of 
DOPC-siRNA-PBS solution by the intraperitoneal route). 

Xenograft tumour studies. Four- to six-week-old female NOD/SCID mice were 
purchased from Jackson Laboratories and housed under pathogen-free condi- 
tions. All studies were approved and supervised by the Institutional Animal 
Care and Use Committee of MD Anderson Cancer Center. When used in a power 
calculation, our sample size predetermination experiments indicated that five mice 
per group can identify the expected effect of POLR2A on tumour size and weight 
(P <0.05) with 90% power. Animals were randomly divided to different groups. 
Dox-inducible HCT116 (1 X 10°) and SNU283 (2 X 10°) cells in 50 ul growth 
medium (mixed with Matrigel at 1:1) were injected subcutaneously into the flank 
using a 100-1 Hamilton microlitre syringe. Tumour size was measured every 
3 days using a calliper, and tumour volume was calculated using the standard 
formula: 0.5 X L X W’, in which L is the longest diameter and W is the shortest 
diameter. For orthotopic mouse model, the NOD/SCID mice were anaesthetized 
and the skin was incised to expose caecum. Dox-inducible HCT116 cells (1 X 10°) 
expressing luciferase were injected into the caecal wall using a 100-1 Hamilton 
microlitre syringe, and then the incision was closed using wound clips. Tumours 
were monitored by the IVIS system after luciferin injection for 15 min. After initial 
establishment of tumour (100mm? for subcutaneous implants and 2 Xx 10° 
photons per second, total flux for orthotopic implants), mice were treated with 
1pgml' doxycycline in drinking water for 3-4 weeks. The doxycycline water was 
changed every other day. 

For xenograft tumour studies using DOPC-encapsulated siRNAs, isogenic pairs 
of HCT116 (1 X 10°) cells were transplanted into the caecal wall using a 100-1 
Hamilton microlitre syringe. Ten days after cell injection, mice were randomly 
divided and assigned to receive either control-siRNA-DOPC or POLR2A-siRNA- 
DOPC. The siRNA sequences are as follows: control siRNA (5’-UUCUCCGAAC 
GUGUCACGU-3'’ and 5'-ACGUGACACGUUCGGAGAA-3'); POLR2A 
siRNA-1 (5'-CCAACAUGCUGACAGAUAU-3’ and 5’-AUAUCUGUCAGCA 
UGUUGG-3'); POLR2A siRNA-2 (5'’-CCAAGAAGCGGCUCACACA-3’ and 
5'-UGUGUGAGCCGCUUCUUGG-3’). A dose of 150-1,000 pig siRNA per kilo- 
gram mouse was administrated intraperitoneally at twice weekly intervals. This 
range of concentrations was determined by the Center for RNA Interference and 
Noncoding RNAs at MD Anderson Cancer Center, which ensures efficient deliv- 
ery and knockdown of target genes as previously described”. 

For xenograft tumour studies using ama~HEA125 antibody-—drug conjugate, 
isogenic pairs of HCT116 (1 X 10°), xhCRC (0.5 X 10°), SW480 (1 X 10°) or 
SW837 (2 X 10°) cells were transplanted into the caecal wall using a 100-pl 
Hamilton microlitre syringe. Mice bearing 10-day-old tumours were randomized 
to five groups ( = 10 mice per group) and received two intraperitoneal doses (at 
10 days and 17 days, 1 week apart) of the following: (1) control unconjugated 
HEA125 monoclonal antibody at a dose of 3.6mgkg ' of body weight; (2) 
ama-HEA125 at a dose of 90 ugkg ' in terms of o-amanitin (corresponding to 
3.6mgkg * of IgG); (3) ama-HEA125, 30 ugkg ' in terms of o-amanitin (cor- 
responding to 1.2 mg kg of IgG); (4) ama-HE125, 10 pg kg” * in terms of o-ama- 
nitin (corresponding to 0.4mgkg ' of IgG); and (5) ama-HEA125, 3 ugkg * in 
terms of o-amanitin (corresponding to 0.12mgkg * of IgG). Tumours were 
monitored by the IVIS live imaging system twice a week after luciferin injection 
for 15 min. Body weights were recorded every four days. Blood was obtained by 
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retro-orbital bleeding after anaesthesia on day 21 and levels of blood liver enzymes 
(aspartate amino transferase, amino alanine transferase and alkaline phosphatase) 
were determined at the Clinical Pathology, Veterinary Medicine and Surgery Core 
of MD Anderson Cancer Center. 

The person who provided animal care and measured tumour growth was 
blinded to the group allocation during all animal experiments and outcome assess- 
ment. Mice were euthanized when they met the institutional euthanasia criteria for 
tumour size and overall health condition. Tumours were removed, photographed 
and weighed. The freshly dissected tumour tissues were fixed in 10% buffered 
formalin overnight, washed with PBS, transferred to 70% ethanol, embedded in 
paraffin, sectioned and stained with haematoxylin and eosin. The investigators 
were blinded to the group allocation during the experiment and when assessing 
the outcome. 

Immunohistochemistry and human colon tissue microarray. Colon cancer 
tissue microarray (BC051110a) was purchased from Biomax, including 110 colon 
tumour samples and 10 nomral colon tissue samples. Samples were deparaffinized 
and rehydrated. Antigen was retrieved using 0.01 M sodium citrate buffer (pH 6.0) 
at a sub-boiling temperature for 10 min after boiling in a microwave oven. To 
block endogenous peroxidase activity, the sections were incubated with 3% hydro- 
gen peroxide for 10 min. After 1h of pre-incubation in 5% normal goat serum to 
prevent nonspecific staining, the samples were incubated with antibody against 
POLR2A (sc-47701, Santa Cruz), Ki67, (D3B5, Cell Signaling), or cleaved caspase- 
3 (SA1E, Cell Signaling) at 4°C overnight. The sections were incubated with a 
biotinylated secondary antibody (4Plus Biotinylated anti-mouse or anti-rabbit 
IgG, BioCARE) and then incubated with avidin-biotin peroxidase complex solu- 
tion and developed using a DAB (diaminobenzidine) substrate kit (550880, BD 
Biosciences) according to the manufacturer’s protocol. Counterstaining colour 
was carried out using Harris modified haematoxylin. All immunostained slides 
were scanned on the Automate Cellular Image System III (ACIS III) for quan- 
tification by digital image analysis. 

Bioinformatic analysis. We analysed the correlation between gene copy number 
and the corresponding gene expression using data obtained from CCLE (http:// 
www.broadinstitute.org/ccle) and TCGA (http://www.cbioportal.org/public- 
portal/) as previously described’’. Enrichment of Pearson correlation coefficients 
was determined by permuting gene names. To determine whether a deleted gene 
functions as a housekeeping gene, we first analysed its expression profiles in 
tumour and normal tissues as well as its general functions from literature. 
Second, we checked gene conservation across species and lethality of the gene 
knockout by searching available databases of model organisms (Saccharomyces 
Genome Database, WormBase, FlyBase and Mouse Genome Informatics). Third, 
we checked the proximity of the potential target gene to the TP53 gene and 
analysed its co-deletion with TP53 in human cancers. Finally, we searched for 
available cancer cell lines with the deletion of TP53 and the target gene to test our 
hypothesis. 

Statistical analysis. Each experiment was repeated three times or more. Unless 
otherwise noted, data are presented as mean and s.d. or s.e.m., and Student's t-test 
(unpaired, two-tailed) was used to compare two groups of independent samples. 
In an unpaired ¢-test, we assumed equal variance and no samples were excluded 
from the analysis. Statistical methods used for TCGA data analysis are described 
above. P < 0.05 was considered statistically significant. 


25. Liu, Y. etal. Kaposi’s sarcoma-associated herpesvirus-encoded microRNA miR- 
K12-11 attenuates transforming growth factor beta signaling through 
suppression of SMADS. J. Virol. 86, 1372-1381 (2012). 

26. Cong,L.eta/. Multiplex genome engineering using CRISPR/Cas systems. Science 
339, 819-823 (2013). 

27. Guschin, D. Y. et al. A rapid and general assay for monitoring endogenous gene 
modification. Methods Mol. Biol. 649, 247-256 (2010). 

28. Ran, F. A. et al. Genome engineering using the CRISPR-Cas9 system. Nature 
Protocols 8, 2281-2308 (2013). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 
z gS 
“oO 
58 33 8B 
2 2 
ES BK z 
34 FN £ 
Be 3 a 5 
a ro) z 
a 
b p=0.142 c 
#1 #2 #30 #4 #5 GTB HO 
NTNTN TNT NTNTN TN T N T 


#1 #2 #300 #4 
NT NTN TN T 


Relative POLR2A expression 


Tumour 
POLR2A°5 


Tumour 
POLR2Aneutral 


Normal 
tissue 


Extended Data Figure 1 | Expression of POLR2A correlates with its gene 
copy number in human colon tumours. a, Top, double-colour FISH analysis 
using a probe for chromosome 17 centromere (green) and a locus-specific 
probe for POLR2A (red) on human colon tissue microarray. Bottom, 
immunohistochemical staining of POLR2A in the corresponding tissue 
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samples. Hemizygous loss of the POLR2A gene was determined, and the results 
are shown in Supplementary Table 2. b, Quantification of POLR2A expression 
in human colon normal (n = 7), POLR2A"™™ (n = 43) or POLR2A'°S 

(n = 29) tumour tissue samples. Error bars, s.d. c, Protein levels of POLR2A 
and f-actin in matched normal and CRC tissue samples. 
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Extended Data Figure 2 | Expression of TP53 is not associated with its gene _c, Relative mRNA expression of TP53 in human CRC cell lines (normalized to 
copy number. a, b, Scatterplots of TP53 copy number versus protein that in the HCT116 cell line). Data are mean and s.d. of three independent 
expression (a) or mRNA expression (b) in colorectal tumours in TCGA experiments. 


database. Pearson correlation coefficients (r) and P values are displayed. 
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Extended Data Figure 3 | POLR2A' cells are highly sensitive to POLR2A 
inhibition. a, Cell proliferation of POLR2A"@! and POLR2A'°® cells treated 
with actinomycin D. b, Knockdown efficiency of POLR2A-specific shRNAs in 
HCT116, SW480, SW837 and SNU283 cells. shNT denotes non-targeting 

control shRNA. c, Effect of POLR2A knockdown on the proliferation of four 
colorectal cancer cell lines. Cells expressing GFP and control or POLR2A- 

specific shRNAs were sorted and mixed with control GFP-negative cells (1:1) 
and the GFP-positive cells were quantified at passages 2, 4 and 6. **P < 0.01; ns, 


not significant. d, Protein levels of POLR2A in HCT116 and SNU283 cells 
expressing Dox-inducible POLR2A shRNAs (1.0 pg ml* Dox). e, Cell 
proliferation of HCT116 and SNU283 cells expressing Dox-inducible POLR2A 
shRNA in the presence of 300 ngml_* Dox. **P < 0.01. f, g, Cell cycle profiles 
(f) and apoptosis (g) of control or POLR2A shRNA-expressing HCT116 and 
SNU283 cells. **P < 0.01. Data are mean and s.d. of three independent 
experiments. 
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©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a c 
Cas9Q + + + CasQ + + 
sgRNA - + + sgRNA = + sgRNA-1 mutant #14 
—__—___—_—_—_—_—_P 
_ 
- 'GGCCGCTGCCAAACATGTGC: 
sgRNA-2 CT AGG 1000 je a ed 4010 bp 4000 = 10100 GGGGCGGCCTCCCTCAGTCGTCTCTGGGTATTTGATGCC §=WT 
700 — i) 701 bp 700 = 774 bp GGGGCGGCCTCCCTCAGTC---CTCTGGGTATTTGATGCC. -2 bp 
500 5 
POLReS 303 = 309bp 300. ame 
B00 200 236 bp  SORNA-2 mutant #5 
sgRNA-1 GGTCTCTGCTGACTCCCTCCGGCG 100 100 = GGACTGGCCGCTGCCAAACATGTGCAGGTAAGTGCTGG }=WT 
—_—_—_ 
(BP) BG]ACTGGCCGCTGCCAAACen-n-n-n-nenenenenenenenee = 
sgRNA-1 sgRNA-2 GGACTGGCCGCTGCCAAAC TecTee -13 bp 
d g Neutral Loss-1 Loss-2 — shRNA-2 
= 50 _ o === shRNA-1 
s = 2 g* >= shNT 
Ss S 2 4 2 40 2 40 
So ¥ F 8 ns 9 8 
i ° 30 © 30 ° 30 
| “i 2 2 g : 
_, oan B 19 B10 B 49 
B-Actin ——_ — a o a 
th a ra rm 
Oo 0 T T T T oO 0 T T T T Oo 0 T T T T 
PO P2 P4  P6 PO P2 P4 PG PO P2 P4 ~~ PG 
Passages Passages Passages 
e f h 
1.2 —- Loss-2 
E87 Loss-2 8 -=- Loss-1 
3 ae hose: aed > Neutral 
® 64 > Neutral 208 
& 3 © oD Sash $8 © 
® 2 Dox 9 % © PO 'ay (ng/ml) Dox 9 % . SoH (ngimi) Dox © 
Q < shNT @@@666 shnT @ rr) 6 shNT 6600 
4g 2 shRNA-1 e9ee8s shRNA-1| 1S shRNA-1 eens 
S 3 shRNA-2 © O64 O68 shRNA-2 © TY Tole) ®, shRNA-2 60558 a 
2 a, Neutral Loss-1 Loss-2 


9 oY Oo 2 19 


seHxFes & oO” oF oF oF AS 


Days after seeding 


Actinomycin D (ug/ml) 


Extended Data Figure 5 | Mono-allelic knockout of POLR2A sensitizes 
HCT116 cells to POLR2A inhibition. a, Schematic illustration of the Cas9/ 
sgRNA-targeting sites in the POLR2A gene. Two single-guide RNA (sgRNA)- 
targeting sequences are shown and the protospacer-adjacent motif (PAM) 
sequences are highlighted in red. b, Efficiency of the Cas9-mediated cleavage of 
POLR2A in HCT116 cells measured by the Surveyor assay. c, Sequences of 
mutant POLR2A alleles in the cell colonies 14 and 5. PAM sequences are 
highlighted in red. Small deletions in the targeted region led to open reading 
frame shift, producing only a short stretch of the amino-terminal peptide 


without any functional domains of POLR2A. d, Protein levels of POLR2A in 
POLR2A"™** and POLR2A'°* HCT116 cells. e, Growth curves of 
POLR2A"™*! and POLR2A!°** HCT116 cells. f, Relative proliferation of 
POLR2A"™™@ and POLR2A'°® cells treated with actinomycin D. g, Effect of 
POLR2A knockdown on the POLR2A"™ and POLR2A'°°S HCT116 cells. 
Experiments were performed as described in Extended Data Fig. 3c. 

**P < 0.01. h, Dox-induced partial suppression of POLR2A inhibited the 
growth of POLR2A’°s HCT116 cells, but not of parental POLR2A"™™ 
HCT116 cells. Data are mean and s.d. of three independent experiments. 
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Extended Data Figure 6 | Sensitivity of POLR2A’°™ cells to POLR2A 
inhibition is independent of p53. a, Schematic illustration of the Cas9/ 
sgRNA-targeting sites in the TP53 gene. Two sgRNA-targeting sequences are 
shown and the PAM sequences are highlighted in red. b, Efficiency of the Cas9- 
mediated cleavage of TP53 in HCT116 cells measured by Surveyor assay. 

c, Protein levels of POLR2A and p53 in a panel of isogenic HCT116 cells. 


d, Growth curves of POLR2A""" and POLR2A' xhCRC cells. e, Growth 
curves of POLR2A"™"* and POLR2A°s HCT116 cells. f, g, Crystal staining 
images (f) and cell survival curves (g) of POLR2A7™ and POLR2A!°ss 
HCT116 cells treated with o-amanitin. h, Cell survival curves of POLR2A"™™ 
and POLR2A’°s HCT116 cells in response to the treatment of ama-HEA125. 
Data are mean and s.d. of three independent experiments. 
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Extended Data Figure 7 | Dose-dependent suppression of POLR2A inhibits 
tumorigenesis in POLR2A’°™, but not POLR2A"™"™ tumours. 

a, Quantification of POLR2A mRNA expression levels in subcutaneously 
xenografted HCT116 and SNU283 tumours expressing control or POLR2A 
shRNA (n= 5 mice per group). **P < 0.01. Data are mean and s.d. 

b, Immunohistochemical staining of the aforementioned xenograft tumours. 
HE, haematoxylin and eosin. ¢, Cells positive for Ki67 (cell proliferation) or 
cleaved caspase-3 (apoptosis) per field and POLR2A expression in b were 
quantified. **P < 0.01. n = 10 fields. Data are mean and s.d. d, Gross tumour 


images of xenograft tumours derived from subcutaneously implanted 
POLR2A7* and POLR2A'°s HCT116 cells (1 X 10° cells injected). Both cell 
lines express control or Dox-inducible POLR2A shRNAs. After the initial 
establishment of tumours (100 mm*), mice were treated with Dox (0.5, 1 and 
2 wg ml’) in drinking water. n = 5 mice per group. e, Quantification of tumour 
sizes as shown in d. Data are mean and s.d. f, Representative bioluminescent 
images of orthotopically implanted HCT116 tumours expressing Dox- 
inducible control or POLR2A shRNA after Dox treatment. 
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Extended Data Figure 8 | Suppression of POLR2A with DOPC- 
encapsulated POLR2A siRNA inhibits the growth of POLR2A’°“ tumours. 
a, Protein levels of POLR2A after transfection of control siRNA or POLR2A 
siRNAs (shPol2-1 and shPol2-2) in HCT116 cells. b, Schematic illustration of 
orthotopic injection of HCT116 cells (1 X 10° cells) followed by siRNA-DOPC 
nanoliposome treatment. c-f, Representative bioluminescent images (c, e) and 
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tumour growth curves (d, f) of orthotopic xenograft tumours derived from 
POLR2A"™™ and POLR2A°s HCT116 cells that received intraperitoneal 
injections of control (1,000 1g kg” ') or POLR2A siRNAs (125, 250, 500 and 
1,000 pg kg *) twice weekly. n = 10 mice per group. Error bars, s.e.m. 

g, h, Representative protein levels of POLR2A in xenograft tumours after 
control or POLR2A siRNA treatment. 
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Extended Data Figure 9 | Suppression of POLR2A selectively inhibits the 
POLR2A'° tumour growth. a, Immunohistochemical staining of 
xenografted xhCRC tumours. b, c, Tumour weights of orthotopically implanted 
HCT116 (b) and xhCRC (c) tumours. 1 = 10 mice per group. Data are mean 


and s.d. d, e, Body weights (d) and liver enzymes (e) including alanine 
aminotransferase (ALT), aspartate aminotransferase (AST) and alkaline 
phosphatase in peripheral blood. Data are mean and s.d. n = 5 mice. 
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Extended Data Figure 10 | Suppression of POLR2A by ama-HEA125 
inhibits the growth of POLR2A °* tumours. a, d, g, Protein levels of POLR2A 
in HCT116 (a), SW480 (d) or SW837 (g) cells. These cell lines are 
POLR2A""™", POLR2A'°S or POLR2A-restored. b, ¢, e, f, h, i, Representative 
bioluminescent images (b, e, h) and tumour growth curves (c, f, i) of orthotopic 


aie tumours derived from the corresponding cells as indicated. All of 
them received dual intraperitoneal injections of anti-EpCAM antibody 
(3.6mgkg ') or ama-HEA125 antibody-drug conjugate (10 and 90 ugkg |, 
corresponding to 0.4 and 3.6 mg IgG kg"). n = 10 mice per group. Error bars, 
s.e.m, 
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Structural basis of CpG and inhibitory DNA 
recognition by Toll-like receptor 9 


Umeharu Ohto!, Takuma Shibata??, Hiromi Tanji', Hanako Ishida’, Elena Krayukhina*®, Susumu Uchiyama‘, Kensuke Miyake? 


& Toshiyuki Shimizu’? 


Innate immunity serves as the first line of defence against invading 
pathogens such as bacteria and viruses’. Toll-like receptors (TLRs) 
are examples of innate immune receptors, which sense specific molec- 
ular patterns from pathogens and activate immune responses’. 
TLR9 recognizes bacterial and viral DNA containing the cytosine- 
phosphate-guanine (CpG) dideoxynucleotide motif**. The molecular 
basis by which CpG-containing DNA (CpG-DNA) elicits immuno- 
stimulatory activity via TLR9 remains to be elucidated. Here we show 
the crystal structures of three forms of TLR9: unliganded, bound to 
agonistic CpG-DNA, and bound to inhibitory DNA (iDNA). Agonistic- 
CpG-DNA-bound TLR9 formed a symmetric TLR9-CpG-DNA 
complex with 2:2 stoichiometry, whereas iDNA-bound TLR9 was a 
monomer. CpG-DNA was recognized by both protomers in the dimer, 
in particular by the amino-terminal fragment (LRRNT-LRR10) from 
one protomer and the carboxy-terminal fragment (LRR20-LRR22) 
from the other. The iDNA, which formed a stem-loop structure suit- 
able for binding by intramolecular base pairing, bound to the concave 
surface from LRR2-LRR1O. This structure serves as an important 
basis for improving our understanding of the functional mechan- 
isms of TLR9. 

The immunostimulatory activity of CpG-DNA is affected by its 
length, the number of CpG motifs, and the sequences flanking the CpG 
motif. The core CpG motif, which consists of a hexamer with a central 
unmethylated CpG, has the general formula RRCGYY (where R repre- 
sents a purine and Y a pyrimidine)°. Unlike other TLRs, TLRs 7-9 each 
have a long insertion loop (Z-loop) between LRR14 and LRR1S5 (Ex- 
tended Data Fig. 1). Several recent studies have shown that proteolytic 
processing at the Z-loop is necessary for the creation of functional 
TLR9®"*. To reveal the molecular mechanism by which TLR9 specif- 
ically recognizes CpG-DNA and sends signals to the intracellular com- 
partment, we performed biochemical, biophysical and crystallographic 
studies of the agonistic- and antagonistic- DNA-binding modes. 

Wescreened recombinantly expressed extracellular domains of TLR9 
from various species. Of the proteins we examined, the purified yields 
of the horse (Equus caballus; Ec), bovine (Bos Taurus; Bt) and mouse 
(Mus musculus; Mm) orthologues (Extended Data Fig. 1) were suffi- 
cient for crystallographic study. We used DNA1668_12mer derived from 
20-mer single-stranded DNA1668° as agonistic DNA (Extended Data 
Fig. 2a), and iDNA4084"' and iDNA_super as iDNAs (Fig. 1a). DNA1668 
and iDNAs (i{DNA4084 and iDNA_ super) function as an agonist and 
antagonist, respectively, for both horse and bovine TLR9 (Extended Data 
Fig. 2b). Unless otherwise noted, the DNAs used in this study were 
single-stranded. Both DNA1668_12mer and iDNA4084 were able to 
bind Z-loop-processed and -unprocessed TLR9, demonstrating that the 
binding of both types of DNA is independent of Z-loop processing 
(Fig. 1b, c). Z-loop-unprocessed TLR9 remained mostly monomeric, 
irrespective of the presence of agonistic DNA1668_12mer, whereas the 
proportion of Z-loop-processed TLR9 present as a dimer increased 


significantly in the presence of DNA1668_12mer (Fig. 1d and Extended 
Data Fig. 3). These results indicate that the Z-loop is functionally sig- 
nificant in the ligand-dependent oligomerization of TLR9. By contrast, 
DNA1668_12mer_GC (DNA1668 with the CpG motif swapped for 
GpC) and iDNA4084 could induce the formation of only small quant- 
ities of TLR9-DNA dimers (Fig. 1d). 

We determined the crystal structures of the unliganded, agonistic- 
DNA-bound, and iDNA-bound forms of Z-loop-processed TLR9 molec- 
ules (Extended Data Table 1 and Extended Data Fig. 4). In the crystals, 
the unliganded and iDNA-bound forms of TLR9 are monomeric 
(Fig. 2a, b), whereas the agonistic-DNA1668_12mer-bound forms of 
TLR9 are dimeric (Fig. 2c and Extended Data Fig. 5). Similar to other 
TLRs'*”, the agonistic-DNA-bound forms form an m-shaped 2:2 com- 
plex in which the C-termini of the two TLR9 protomers are positioned 
in the centre (Fig. 2c and Extended Data Fig. 5d). 

The agonistic DNA binds to two equivalent positions in the dimer, 
and each DNA1668_12mer is recognized by both TLR9 and TLR9* 
(The second TLR9 within the dimer and its residues are indicated with 
asterisks) in a bent conformation (Fig. 2c and Extended Data Fig. 6). 
DNA1668_12mer winds around the N-terminal fragment of TLR9 from 
the ascending lateral face to the concave face to interact with a region 
spanning from LRRNT to LRR1O (interface 1). This binding region is 
consistent with the results of previous reports’*’°. Simultaneously, 
DNA1668_12mer interacts with the loop regions from LRR20-22 in 
the C-terminal fragment of TLR9* (interface 2) (Extended Data Fig. 6b). 
Thus, agonistic DNA acts as ‘molecular glue’ to bridge the two TLR9 
molecules. This structural feature strongly suggests that only single- 
stranded DNA can act as an agonist. Accordingly, double-stranded DNAs 
containing the CpG motif had greatly reduced affinity (Extended Data 
Fig. 7 and Extended Data Table 2). 

In DNA1668_12mer, the G4-T9 sequence corresponding to the con- 
sensus hexamer of GACGTT is mainly recognized by TLR9 as opposed 
to the other part of DNA. The bases of the CpG motif are accommo- 
dated in the groove formed by LRRNT, LRR1 and LRR2 in the ascend- 
ing lateral face of TLR9 (Fig. 3a). The CpG motif and the flanking bases 
are recognized via interactions with multiple amino acids, as well as via 
water-mediated hydrogen bonds (Fig. 3a). The C6 moiety in the CpG 
motif forms direct hydrogen bonds: the cytosine O2 atom with Met106 
N and Serl04 Oy and the cytosine N3 atom with Ser104 Oy. In addi- 
tion, the cytosine N4 atom makes water-mediated (W1 and W2) hydro- 
gen bonds with His76, Pro99 and Phe108, and the cytosine ring itself 
is wedged between Pro105, Phe108 and a neighbouring CpG guanine 
(G7) (Fig. 3a). Together with Trp47 and Phe49, Cé forms a three-walled 
cage that accommodates the neighbouring G7. The G7 N2 atom engages 
in hydrogen bonds with the Trp96 O and O4 atom of T9 at the +2 
position, anchoring the guanine ring to the bottom of the CpG binding 
groove. Because the N2 atom is unique to guanine these interactions 
define the specificity for guanine in the CpG motif. The thymine ring 


1Graduate School of Pharmaceutical Sciences, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo 113-0033, Japan. “Division of Innate Immunity, Department of Microbiology and Immunology, Laboratory 
of Innate Immunity, Center for Experimental Medicine and Systems Biology, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan. °Core 
Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency (JST), Saitama 332-0012, Japan. *Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, 
Suita, Osaka 565-0871, Japan. 5U-Medico Corporation, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan. 


702 | NATURE | VOL 520 | 30 APRIL 2015 


©2015 Macmillan Publishers Limited. All rights reserved 


DNA1668_12mer 


DNA1668_12mer_GC 


s2=1 4142 
5’ cCla2p3Gtascoc pep9clociipi2 3/ 


5’ Cla2T3Gtascoc™T8pIcloclipi2 3’ 


b 


100 kDa 
— 
75 kDa 


y 2 


1 EcTLR9_Z-loop unprocessed 


2 EcTLR9_Z-loop processed 


LETTER 


Figure 1 | DNA binding to TLR9 is independent 
of Z-loop processing, but subsequent 
oligomerization is dependent on processing. 

a, DNA sequences used in this study. The DNA 
regions involved in the intramolecular base pairing 
in iDNA4084 and iDNA_super are underlined. 

b, SDS-PAGE analysis of ECTLR9 with 
unprocessed and processed Z-loops. ¢, Gel- 
filtration chromatography of EcTLR9. Black line 
and dashed line denote absorption at 280 nm 

and 260 nm, respectively. The ratios of absorbances 
at 260 nm and 280 nm are shown in parentheses. 
d, The oligomerization states of ECTLR9 analysed 
by SV-AUC. The original c(s) distributions 

were normalized against the height of the main 
peak (see Methods). kDa, kilodalton; S, Svedberg 
(unit of sedimentation coefficient). 
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at the +1 position (T8) stacks with Trp47, and the thymine ring at the 
+2 position (T9) is inserted into the CpG-binding groove and sand- 
wiched between Trp47 and Trp96. T9 also forms hydrogen bonds to 
Ser72 and G7. The adenine ring at the -1 position (A5) stacks with 
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Phe108, and in turn G4 (-2 position) stacks onto A5 (Fig. 3a). Because 
purine bases can form more extensive contacts than pyrimidine bases, 
the purine-purine sequence preceding the CpG motif is favoured by 
this three-layered stacking interaction revealed by gel-filtration ana- 
lysis (Extended Data Fig. 8). The backbone phosphates of G4 and A5 
are recognized electrostatically by the positively charged side chains of 
Lys51, Arg74, His76 and His77. The mutation of several residues (Trp47, 
Trp96 and Phe108) important for the interaction with DNA resulted in 
a protein with dramatically reduced binding affinity (Extended Data 
Table 2), in agreement with the results of the structural analysis. 

Interface 2 also plays an important role in the dimerization of TLR9. 
Residues of interface 2 engage in four hydrogen bonds and several van 
der Waals interactions (Fig. 3b, c). In contrast to interface 1, which 
recognizes the base moieties of the CpG motif, LRR20*-LRR22* from 
the C-terminal fragment primarily recognize the backbone of CpG motif- 
containing DNA. The insertion loops in the ascending lateral surface of 
LRR2, LRR5, LRR8, LRR11, LRR18* and LRR20*, which is character- 
istic of the TLR7-9 family, are involved in protein-protein interactions 
with complementary shapes (Fig. 3b, d). 

We mutated the residues important for the recognition and exam- 
ined the ability to activate NF-«B (Fig. 3e). Most of the mutants exhib- 
ited reduced or completely abolished activation of NF-«B signalling in 
response to DNA1668. These results clearly demonstrate that inter- 
faces 1 and 2 are both important for the functional integrity of TLR9. 
We converted the CpG motif into GC, UG, TG and CA and employed 
isothermal titration calorimetry (ITC) to determine the affinity of TLR9 


Figure 2 | Structures of TLR9. a, Monomer structure of unliganded ECTLR9 
showing the lateral face (left) and the convex face from the N-terminal side 
(right). The N-glycan residues are shown in stick representations with their O, 
N and C atoms coloured red, blue and grey, respectively. The N- and 
C-terminal halves of EcTLR9 are coloured light green and orange, respectively. 
b, Structure of EcTLR9 in complex with iDNA4084 showing the lateral face 
(left) and the concave face of the N-terminal half (right). Bound iDNA4084 is 
shown in stick representation and semi-transparent surface representation. 
The 5’ and 3’ ends of the DNA are indicated. c, Dimer structure of the ECTLR9- 
DNA1668_12mer complex. EcTLR9 and its dimerization partner EcCTLR9* are 
shown in green and cyan, respectively. The second TLR9 within the dimer and 
its residues are indicated with asterisks. The DNA1668_12mer molecules are 
shown in stick representation and semi-transparent surface representation. 
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Figure 3 | Agonistic-CpG-DNA recognition by TLR9. a, Left, CpG-DNA 
binding groove formed by LRRNT, LRR1 and LRR2. Middle, overview of 
CpG-DNA recognition by TLR9 (EcCTLR9_DNA1668_12mer interface 1). 
Right, magnified view of the CpG recognition. The simulated omit electron 
densities are shown at a contour level of 3.00. b, Magnified view of 
EcTLR9_DNA1668_12mer in interface 2. c, Schematic summary of CpG-DNA 
recognition. Dashed lines in red and black depict hydrogen bonds and van der 
Waals interactions, respectively. d, Magnified view of the protein-protein 


for the resultant DNAs. The dissociation constant (Kg) values were 20 nM 
(CG), 569 nM (GC), 54nM (UG), 163 nM (TG) and 883 nM (CA), re- 
spectively (Extended Data Table 2), demonstrating that the CG sequence 
is important for binding. The mutations of the CpG motif reduced 
affinities for TLR9: several interactions were disrupted by swapping CG 
for GC, whereas the direct interactions between G7 (N2) and Trp96 (O) 
and T9 (O04) were lost by the conversion of CG into CA. C6 (N4) forms 
a hydrogen bond with the water molecule (W1) that makes hydrogen 
bonds with Pro99 O and Phe108 O. The substitution of UG for CG would 
be unfavourable because W1 is surrounded by three hydrogen bond 
acceptors. The conversion of CG into TG would result in a weak affin- 
ity for the same reason as UG, and the methyl group might further 
weaken an affinity, possibly by disrupting water molecule clusters; the 
importance of water-mediated interactions for the recognition of meth- 
ylated DNA in MeCP2 having been previously identified”. We also 
employed ITC to assess the pH-dependence of CpG-DNA binding to 
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interface. e, NF-«B activation of mouse TLR9 mutants stimulated by agonistic 
DNA1668 (1 11M). Residues in parenthesis are derived from horse TLR9. ‘Ctrl’ 
represents the empty vector without mouse TLR9 cDNA. f, The NF-«B 
activation of mouse TLR9 by DNA1668, DNA1668_GC or DNA1668_met 
in HEK293T cells. Ctrl, no stimulation. In e and f, data from triplicate 
experiments are shown as mean + s.d. and two-tailed Student’s t-test was used 
to evaluate the statistical significance (**P < 0.01). 


TLR9. Binding affinity decreased as pH increased, with Ky values ran- 
ging from 20nM at pH6.0 to 2500 nM at pH 8.0, revealing that the 
interaction was stronger under acidic than basic conditions (Extended 
Data Table 2). Consistent with this, the structural study revealed that 
His residues are concentrated around the DNA-binding region, result- 
ing in a higher-affinity interaction under acidic conditions (Extended 
Data Fig. 6c). In addition, we examined the binding affinity of methy- 
lated CG (DNA1668_12mer_met) for TLR9. The K, value for this in- 
teraction was 50nM (Extended Data Table 2), demonstrating that 
methylated CG yields weaker binding. Accordingly, sedimentation ve- 
locity analytical ultracentrifugation (SV-AUC) analysis demonstrated 
that methylation of the CpG motif exhibited a reduced ability to dimer- 
ize (Fig. 1d). Consistent with this, DNA1668_met exhibited reduced 
activation (Fig. 3f). 

Wealso determined the crystal structures of TLR9 bound to iDNAs 
(iDNA4084 and iDNA_super) (Fig. 2b and Extended Data Table 1). 


Figure 4 | Recognition of iDNA by TLR9. 

a, Magnified view of iDNA4084 bound to EcTLR9. 
b, Schematic summary of iDNA recognition. 
Dashed lines in red and black depict hydrogen 
bond and van der Waals interactions, respectively. 
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The binding site for iDNA partially overlaps with the binding site for 
agonistic DNA (Extended Data Fig. 6a, b), and this overlap between 
binding sites accounts for the antagonistic effect of iDNA. Of particu- 
lar interest, both iDNA4084 and iDNA_ super in complex with TLR9, 
interacting with LRR2-LRR11, form stem-loop structures that fit snugly 
into the interior of the ring structure of TLR9 (Fig. 4). The stem-loop 
structure of the iDNAs is formed by intramolecular base pairing be- 
tween C1-C2-T3 and T7-G8-G9 (two GC pairs and one TT mismatch 
pair) in the iDNA4084 complex, and between C1-C2-T3-C4 and G13- 
A14-G15-G16 (three GC pairs and one AT pair) in the iDNA_super 
complex (Fig. 4 and Extended Data Fig. 4c-e). The length of the loop 
seems to be immaterial: iDNA_super, which has a long loop, binds sim- 
ilarly to iDNA4084, which has a short loop. The recognition of iDNAs 
is primarily mediated via the DNA backbone. Also, the base at the 
cohesive-end position (G10 in iDNA4084) was directly recognized (Fig. 4 
and Extended Data Fig. 5f). 

The structures of the unliganded, agonistic CpG-DNA-bound and 
iDNA-bound forms of TLR9 described in this study reveal the struc- 
tural basis of CpG-DNA recognition and signalling by TLR9, as well as 
the inhibitory mechanism of iDNA. These results will contribute to the 
development of therapeutic agents that target TLR9. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Protein expression, purification and crystallization. The DNA encoding the ex- 
tracellular domain of Toll-like receptor 9 (TLR9) from various species (human 
(Q9NR9, residues 25-818, 100%), monkey (F6UZJO, residues 26-817, 95.7%), horse 
(Q2EEYO, residues 26-817, 83.6%), bovine (Q866B2, residues 25-815, 77.9%), pig 
(SSR6VO, residues 25-816, 80.4%), rat (MORAAS, residues 26-818, 71.5%), mouse 
(Q9EQU3, residues 26-818, 73.4%) and zebrafish (B3DJW3, residues 23-844, 38.1%)), 
where values in parentheses corresponds to the Uniprot accession number, the 
region and sequence identity (versus human) of the extracellular domain, with a 
C-terminal thrombin cleavage site followed by protein A tag were inserted into the 
expression vector pMT/BiP/V5-His of the Drosophila Expression System. For the 
preparation of crystallization samples of MmTLR3, a total of seven mutations 
(N200Q, N242Q, N309Q, N495Q, N568Q, N695Q and N752Q) were introduced 
to produce the protein with reduced glycosylation sites. Drosophila S2 cells were 
co-transfected with the TLR9 and pCoHygro vectors. Stably transfected cells were 
selected in Sf-900 II SEM medium containing 300 ig ml" hygromycin. Z-loop pro- 
cessing of TLR9 is important for its function® '°. Therefore, to mimic the Z-loop 
processing that occurs in the cell, the purification protocol included V8-protease 
treatment, which yielded the Z-loop-cleaved product. After proteolytic processing, 
the N- and C-terminal halves of TLR9 remained associated in subsequent purifi- 
cation steps. Protein secreted to the supernatant was captured by IgG Sepharose 6 
Fast Flow (GE healthcare) equilibrated with phosphate buffered saline (PBS), washed 
with ten column volumes of PBS, and eluted by 0.1 M glycine-HCl pH 3.5 and 0.15 M 
NaCl. Eluent was immediately neutralized by adding with 1/20 volume of 1 M Tris- 
HCl pH 8.0 and was concentrated to 5-10 mg ml ' and further purified by Superdex 
200 gel filtration chromatography equilibrated with 10 mM Tris-HCl pH 7.5 and 
0.15 M NaCl. For BtTLR9 and EcTLR9, concentrated TLR9 was added with 1/10 
volume of 1 M Na-acetate pH 5.0 and incubated overnight at room temperature 
with 1-2 U per mg of protein endo Hr (New England Biolabs) for saccharide 
trimming. Monomeric fractions from Superdex 200 were collected and was incu- 
bated with 1/20-1/50 (w/w) V8 protease (Wako) for 12 to 48 h to cleave the Z-loop 
and protein A tag. Z-loop-processed TLR9 was further purified by HiTrap SP (GE 
healthcare) cation exchange chromatography. The column was equilibrated with 
10 mM Mes pH 6.0 and 0.1 M NaCl and the bound protein was eluted by a linear 
gradient from 0.1 to 0.7 M NaCl. 

For the crystallizations, purified TLR9 was concentrated to 4.0-6.8 mg mI in 
10 mM Tris (pH 7.5), 150 mM NaCl. To prepare the DNA complex of TLR9, the 
protein solutions were combined with an approximately twofold excess of DNAs 
(DNA1668_12mer, iDNA4084 and iDNA_ super). Crystallization experiments 
were performed with sitting-drop vapour-diffusion methods at 293 K. The crys- 
tallization droplets were made by mixing the equivolume of protein solution and 
reservoir solution, typically around 0.5-2.0 kl, except in the case of the ECTLR9- 
DNA1668_12mer complex where the protein solution and reservoir solution were 
mixed with to a 3:1 ratio. Corresponding to the observed pH dependency of TLR9, 
the crystals of agonistic forms of TLR9 complexed with DNA1668_12mer were 
obtained only in acidic conditions (pH 5.8 and pH'5.5 for horse and bovine TLR9, 
respectively), while the crystal of unliganded and iDNA bound forms of TLR9 were 
obtained in wide range of pH (4.5-8.0). The crystallization conditions are sum- 
marized in Extended Data Table 1. 

Data collection and structure determination. Diffraction data sets were collected 
on beamlines PF-AR NE3A (Ibaraki, Japan) and SPring-8 BL41XU (Hyogo, Japan) 
under cryogenic conditions at 100 K. Crystals were soaked into cryoprotectant solu- 
tion summarized in Extended Data Table 1 and then flash-cooled under a cold gas 
stream. The diffraction data sets were processed using the HKL2000 package” or 
imosflm”. The initial phases for the unliganded form of MmTLR9 were determined 
with the molecular replacement method by using the program Molrep” with the 
coordinates of the human TLRS structure (PDB ID: 3W3J)'®. The model was fur- 
ther refined with stepwise cycles of manual model building using the COOT pro- 
gram” and restrained refinement using REFMAC” or phenix.refine” until the R 
factors converged. The EcTLR9 and BtTLR9 structures were determined by the 
molecular replacement method using the Molrep” program using the refined 
MmTLR9 structure. Ligand molecules, N-glycans and water molecules were mod- 
elled into the electron density maps at the latter cycles of the refinement. The qual- 
ity of the final structure was evaluated with MolProbity””. In the structures of ECTLR9 
(unliganded), ECTLR9-DNA1668_12mer, ECTLR9-iDNA4084, BtTLR9-DNA1668_ 
12mer, MmTLR9 (unliganded), MmTLR9-iDNA4084 (form1), MmTLR9-iDNA4084 
(form2) and MmTLR9-iDNA_ super, 100%, 100%, 98%, 99%, 99%, 99%, 100% and 
99% of the residues were in Ramachandran favoured or allowed regions, respectively. 
The statistics of the data collection and refinement are summarized in Extended Data 
Table 1. The figures representing structures were prepared with PyYMOL”*. 

Isothermal titration calorimetry. ITC experiments were performed at 298 K ina 
buffer composed of 50 mM phosphate buffer pH 6.0-8.0, 250 mM NaCl using a 
MicroCal iT C299 (GE Healthcare). DNAs at a concentration of 50 UM were titrated 


into 5 uM of wild-type or mutant horse TLR9. The titration sequence included a 
single 0.4 1] injection followed by 19 injections, 2 11 each, with a spacing of 120s 
between the injections. OrigineLab software (GE Healthcare) was used to analyse 
the raw ITC data. Thermodynamic parameters were extracted from curve fitting 
analysis with a single-site binding model. 
Analytical ultracentrifugation sedimentation velocity. SV-AUC analyses were 
performed in a ProteomeLab XL-I analytical ultracentrifuge (Beckman Coulter) 
equipped with a 4-hole An60Ti rotors at 20°C using Beckman Coulter 12-mm 
double-sector charcoal-filled epon centerpieces and sapphire windows. The scan- 
ning at 42,000 r.p.m. was performed as quickly as possible between 6.0 and 7.2 cm 
from the axis of rotation with a radial increment of 30 um. To analyse the dimer- 
ization induced by DNA binding, horse TLR9 Z-loop-unprocessed and Z-loop- 
processed samples were run at a loading concentration of 20 1M with or without 
equimolar concentrations of DNAs (Fig. 1d). To analyse the concentration depen- 
dence of the dimerization, the AUC measurements were performed at protein 
concentrations ranging from 1.511M to 301M in the presence of equimolar 
DNA1668_12mer (Extended Data Fig. 3). All sedimentation velocity experiments 
were conducted in a buffer containing 10 mM MES and 250 mM NaCl at pH5.5. 
The sedimentation coefficient distributions were obtained using the c(s) method 
of SEDFIT”. The sedimentation coefficients ranging from 0.1 to 50 S with a loga- 
rithmically spaced grid and resolution of 500 were used. The frictional ratio, menis- 
cus, radial and time-invariant noise were floated during the fitting procedure, anda 
regularization level of 0.68 was used. The partial specific volume, the buffer den- 
sity and viscosity were calculated using the program SEDNTERP 1.09 and were 
0.7407 cm” g', 1.00852 g ml‘ and 1.0256 cP, respectively. The percentages of mono- 
mer and dimer were calculated by dividing the corresponding peak area by the 
sum of the areas under two peaks. 
NF-«B-dependent luciferase reporter assay. To check mouse TLR9 response, 
HEK293T cells were seeded in collagen-coated 6-well plates at a density of 5 X 10° 
cells per well, and transiently transfected with wild-type or mutant mouse TLR9 
cDNAs in pMX-puro-IRES-rat CD2 (1 ug), together with wild-type mouse Unc93B1 
cDNA in pMX-puro (0.5 ig) and a pELAM1-luc reporter plasmid (5 ng), using PEI 
(Polyethylenimine “Max”, MW40,000 ; Polysciences, Inc.) at 36h before stimu- 
lation. To check horse and bovine TLR9 response, HEK293T cells were plated on 
collagen-coated 10-cm dishes at a density of 6 X 10° cells per well, and transiently 
transfected with wild-type horse or bovine TLR9 cDNAs in pMX-puro-IRES-rat 
CD2 (3 1g), together with wild-type human Unc93B1 cDNA in pMX-puro (6 Lg) 
anda pELAM1-luc reporter plasmid (30 ng), using PEI at 30 and 24h before stim- 
ulation. The NF-«B luciferase reporter plasmid, pELAM1-luc, was provided by T. 
Muta (University of Tohoku, Japan)**. Twenty-four hours after first transfection, 
cells were reseeded in collagen-coated flat 96-well plates (Corning) at a density of 
1X 10° cells per well. Then, after pre-culture for 4~6 h, attached cells were stimu- 
lated with various DNAs or 100 ng mI! recombinant human TNF-o. (Wako Pure 
Chemical Industries) for 6h. Stimulated cells were lysed by 40 kl of Cell Culture 
Lysis Reagent (Promega) and 6 il of lysate was subjected to a luciferase assay using 
the Luciferase Assay System (Promega). The relative light unit (RLU) of chemi- 
luminescence was measured by GloMax 96 Microplate Luminometer (Promega). 
Since retroviral vector pMX-puro-IRES-rat CD2 allowed for indirect validation of 
inserted cDNA expression by checking rat CD2 expression, transfection efficiency 
of wild-type and mutant mouse TLR9 cDNAs in HEK293T cells was evaluated by 
examination of cell-surface rat CD2 expression level using a FACSCalibur flow 
cytometer (BD Biosciences). The activity of pELAM1-luc in each transfected 
HEK293T cells was also verified by stimulation of recombinant human TNF-a. 
Oligonucleotide. Oligonucleotides for the gel-filtration, ITC, AUC and crystal- 
lographic analyses were single-stranded DNAs with normal phosphodiester link- 
age unless otherwise noted. Oligonucleotides for the luciferase reporter assay were 
single-stranded DNAs with phosphorothioate linkage. Phosphorothioate DNAs 
(DNA1668, DNA1668_GC, DNA1668_12mer and DNA1668_12mer_GC) were 
purchased from FASMAC (Kanagawa, Japan). Other phosphorothioate DNAs 
(iDNA4084, iDNA_super and DNA1668_met) and all phosphodiester DNAs were 
purchased from Eurofins MWG Operon (Ebersberg, Germany). 
Statistical analysis. Data from triplicate samples in Fig. 3e, 3f and Extended Data 
Fig. 2 were shown as mean + s.d. and subjected to statistical analysis. Statistical 
significance was determined by two-tailed Student’s t-tests. A P value of less than 
0.01 was considered to be significant. No statistical method was used to predeter- 
mine sample size. 
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Signal Sequence LRR15 xLxxLxLxxN 

Human TLRQ MGFCRSALHPLSLLVQAIMLAMTLA 25 Human TLR9 LNFTLDLSRNNLVTVOPEMFAQLS 496 
Horse TLR9 MGPCHGALOPLSLLVQAAMLAVALA 25 Horse. TLR9 LSFTLDLSRNNLVTVQPEMFAQLS 496 
Bovine TLR9 MGP-YCAPHPLSLLVQAAALAAALA 24 Bovine TLR9 LNFTLDLSRNNLVTIQQEMFTRLS 494 
Mouse TLR9 MVLRRRTLHPLSLLVQAAVLAETLA 25 Mouse _ TLR9 FKFTMDLSRNNLYTIKPEMEVNLS 497 
LRRNT LRR16 xLxxLxLxxN 

Human  TLR9 LGTLPAFLPCELQPHGLVNCNWLBLASVPHFSMAAPRG 63 Human _ TLR9 HLOCLRLSHNCISQAVNGSOFLPLT 521 
Horse_ TLR9 QGTLPPFLPCELQPHGLVNCNWLBLASVPHFSAAAPRD 63 Horse TLR9 RLOCLRLSHNSISQAVNGSOFVPLT 521 
Bovine TLR9 EGTLPAFLPCELQPHGQVDCNWLBLASVPHFSAGAPRA 62 Bovine TLR9 RLOCLRLSHNSISQAVNGSQFVPLT 519 
Mouse — TLR9 LGTLPAFLPCELKPHGLVDCNWLELASVPRFSAAASCS 63 Mouse  TLR9 RLOCLSLSHNS TAOAVNGSQFLPLT 522 
LRR1 xLxxLxLxxN LRR17 xLxxLxLxxN 

Human _ TLR9 NVTSLSLS ILHDSDFAHLP 87 Human_ TLRQ GLOVLDLSHNKLDLYHEHSFTELP 545 
Horse TLR9 NVTSLSLL ILHDSDFAQLS 87 Horse TLR9 SLOVLDLSHNKLDLYHGRSFTELP 545 
Bovine TLR9 NVTSLSLI ILHDSDFVHLS 86 Bovine TLR9 SLRVLDLSHNKLDLYHGRSFTELP 543 
Mouse  TLR9 NITRLSLI ILHNSDFVHLS 87 Mouse _ TLR9 NLOVLDLSHNKLDLYHWKSFSELP 546 


LRR2 xLxxLxLxxN LRR18 xLxxLxLxxN 
Human_ TLR9 = SLRHLNURWNCBPVGLSIBMHEIBCHMTIEPSTFLAVP 23 Human_ TLR9 = RLEALDLSYNSQPFGMQGVGHNFSFVAHLR 575 
Horse TLR9 = NLOKLNURWNCBPAGLSPMHBPCHMTIEPNTFLAVP 23 Horse TLR9 = RLEALDLSYNSQPFSMRGVGHNLSFVAQLP 575 
Bovine TLR9 = NLRVLNURWNCBPAGLSBMHBBCRMTIEPNTFLAVP 22 Boviné TLR9 § QLEALDLSYNSQPFSMQGVGHNLSFVAQLP 573 
Mouse TTLR9 — NLRQLNTIGMINCEPTGLBBEHBSCHMTTEPRTFLAMR 23 Mouse _“TLR9  QLOALDLSYNSQPFSMKGIGHNFSEVTHLS 576 
LRR3 xLxxLxLxxN LRR19 xLxxLxLxxN 
Human_ TLR9 = TLEELNLS(YNNIMTVPALPK 43 Human_ TLR9 § TLRHLSLAHNNIHSQVSQOLCST 598 
Horse TLR9 = TLEELNLS®NGITTVPALPS 143 Horse_ TLR9  TLRYLSLAHNGIHSRVSQOLCST 598 
Boviné TLR9 = TLEELNLS|NGITTVPALPS 142 Boviné TLR9 | SLRYLSLAHNGIHSRVSOQKLSSA 596 
Mouse ~TLR9  TLEELNLSWNGITTVPRLPS Mouse ~“TLR9 = MLOSLSLAHNDIHTRVSSHLNSN 599 
LRR4 xLxxLxLxxN LRR20O xLxxLxLxxN 
Human_ TLR9 = SLISLS[S|AI7NILMLDSASLAGLH 167 Human_ TLR9 = SLRALDFSGNALGHMWABGDLYLHFFQGLS 628 
Horse_ TLR9 = SLVSI|ILSIBI(NILOLDPTSLTGLH 67 Horse_ TLR9 = SLWALDFSGNSLSQMWABGDLYLRFFQGLR 628 
Boviné TLR9  SLVSLS[LSIEI'SILVLGPTHFTGLH 66 Boviné_TLR9 | SLRALDFSGNSLSQMWABGDLYLCFFKGLR 626 
Mouse ~TLR9 — SLVNLUSES[If'NILVLDANSLAGLY 67 Mouse “TLR9 — SVRFLDFSGNGMGRMWDEGGLYLHFFOGLS 629 
LRRS xLxxLxLxxN LRR21 xLxXxXLXLxxN 
Human_ TLR9 — ALRFIEMDENC PCRQALEVAPGALLGLG 99 Human_ TLR9 §GLIWLDLSQNRLHTBLPQTLRNLPK 653 
Horse_ TLR9 — ALREI|yMDBNC PCGRALEVAPGALLGLG 99 Horse” TLR9 = SLIRLDLSQNRLHTELPCTLGNLPK 653 
Boviné TLR9 = ALRFI\yMDBNC PCPRALEVAPGALLGLG 98 Boviné_TLR9 | NLVQLDLSENHLATLLPRHLDNLPK 651 
Mouse “TLR9 — SLRVI[EMD]sNCHMNPCTGAVKVTPGALLGLS 99 Mouse _TLR9  GLLKLDLSQNNLHILRPQNLDNLPK 654 
LRR6 xLxxLXLxxN LRR22 xLxxLxLxxN 
Human_ TLR9  NLTHI| NLTVVPRNLPS 220 Human_ TLR9 = SLOVLRLRDNYLABFKWWSLHFLP 677 
Horse” TLR9 = NLTHI NLTTVPRSLPP 220 Horse_ TLR9 §SLOLLRLRNNYLABFNWSSLTLLP 677 
Boviné TLR9  NLTHI NLTEVPRRLPP 219 Boviné TLR9 | SLRQLRLRDNNLABFNWSSLTVLP 675 
Mouse ~TLR9  NLTHI| NLTKVPRQLPP 220 Mouse ~TLR9 | SLKLLSLRDNYLSEFNWTSLSFLP 678 
LRR7 xLxxLxLxxN LRR23 xLxxLxLxxN 
Human_ TLR9 = SLEYJ RIVKLAPEDLANLT 244 Human_ TLR9 = KLEVLDLAGNQLKALTNGSLPAGT 701 
Horse” TLR9 = SLEYI HIVTLAPEDLANLT 244 Horse” TLR9 = NLETLDLAGNQLKALSNGSLPSGT 701 
Bovine TLR9  SLDTILLS¥NHIVTLAPEDLANLT 243 Boviné TLR9 |= RLEALDLAGNQLKALSNGSLPPGI 699 
Mouse _~TLR9 = SLEYI LIVKLGPEDLANLT 244 Mouse_-TLR9 = NLEVLDLAGNQLKALTNGTLPNGT 702 
LRR8 xLxxLXLxxN LRR24 xLxXxLxLxxN 
Human_ TLR9 = AL GCN IPCMECPRHFPQLHPDTFSHLS 284 Human_ TLR9 = RLRRLDVSCNSISFVAPGFFSKAK 725 
Horse” TLR = AL GGN' IPCVECPHKFPQLHSDTFSHLS 284 Horse” TLR9 = QLORLDVSRNSIIFVVPGFFALAT 7125 
Boviné TLR9 AL GGN' IPCRECPKNFPKLHPDTFSHLS 283 Boviné_TLR9 | RLOKLDVSSNSIGEVIPGFFVRAT 7123 
Mouse ~TLR9 = SL GGN IPCIECGOKSLHLHPETFHHLS 284 Mouse “TLR9 © LLQKLDVSSNSIVSVVPAFFALAV 726 
a ise 
LRR9 xLxxLxLxxN LRR25 xLxxLxLxxN 
Human_ TLR9 = RLE| DSSLSWLNASWERGLG 308 Human_ TLR9 = ELRELNLSANALKTVDHSWFGPLAS 750 
Horse” TLR9 — RLEGEIVIAPSSLYQLNPRWFRGLG 308 Horse_ TLR9 = RLRELNLSANALRTVEPSWFGFLAG 750 
Bovine TLR9 — RLEGEVRDSSLYKLEKDWFRGLG 307 Boviné TLR9 | RLIELNLSANALKTVDPSWFGSLAG 748 
Mouse ~TLR9 — HLELIIVIIgOSSLHTLNSSWFQGLV 308 Mouse “TLR9 — ELKEVNLSHNILKTVDRSWFGPIVM 751 
LRR10 xLxxLxLxxN LRR26 xLxxLxLxxN 
Human_ TLR9 = NL FLYKCITKTKAFQGLT 334 Human_ TLR9  ALQILDVSANPLH 7163 
Horse” TLR9 = NL FLYDCITKTKAFOGLA 334 Horse” TLR9 § SLEVLDVSANPLH 7163 
Boviné TLR9 = RL FLYDYITKTTIFNDLT 333 Boviné_TLR9 TLKILDVSANPLH 761 
Mouse —TLR9 NLS FLYESITHTNAFONLT 334 Mouse —TLR9 = NLTVLDVRSNPLH 764 
LRR11 xLxxLxLxxN LRRCT 
Human_ TLR9 YQKRVSFAHLSLAPSFGSLV 364 Human_ TLR9 = CACGAAFMDFLLEVQAAVPGLPSRVKCGSPGQLOQGLSIFAQDLRLCLDEALSWDC 818 
Horse TLR9 YHKKVSFAHLTLAPSFGSLL 364 Horse TLR9 = CACGAAFVDFLLOVQAAVPGLPSRVKCGSPGQLOGRSIFAQDLRLCLDESLSWDC 818 
Bovine TLR9 YHKKVSFAHLHLASSFGSLV 363 Boviné_TLR9 | CACGAAFVDFLLERQEAVPGLSRRVTCGSPGOLOGRSIFTQDLRLCLDETLSLDC 816 
Mouse _~TLR9 ENYRKKVSFARLHLASSFKNLV 364 Mouse “TLR9 — CACGAAFVDLLLEVOTKVPGLANGVKCGSPGOLQGRSIFAQDLRLCLDEVLSWDC 819 
LRR12 xLxxLxLxxN 
Human_ TLR9 — ALKELDMljs TF FRSLDETTLRPLARLP 391 
Horse TLR9 SLOE 48 IFFRSLSQKTLOPLARLP 391 Agonist DNA interfacel 
Bovine TLR9 — SLEKIDMHGIFFRSLTNITLOSLIRLP 390 
Mouse ~TLR9 — SLOEIWNMNGIFFRLLNKYTLRWLADLP 391 
Per Gee oee Agonist DNA interface2 
Human_ TLR9 = MLOTLRLOMNFINQAQLGIFRAFP 415 : ; 
Horse” TLR9 = MLORLYLOMNFINQAQLGIFKDEP 415 [Antagonist DNA interface] 
Bovine TLR9 § KLOSLHLOLNFINQAQLSIFGAFP 414 
Mouse —TLR9 © KLHTLHLQMNE INQAQLSIFGTER 415 Protein-protein interface ( = ) 
LRR14 xLxxLxLxxN 
Human_ TLR9 § GLRYVDLSDNRISGASEL-T 434 
Horse” TLR9 GLRYIDLSDNRISGAVEP-V 434 
Boviné TLR9 | SLLFVDLSDNRISGAATP-A 433 
Mouse “TLR9 — ALRFVDLSDNRISGPSTLSE 435 
Z-loop 
Human TLR9 = ATMGEADGGEKVWLQPGDLAPAPVDTPSSEDFRPNCST 472 
Horse” TLR9 = ATTGEVDGGKKVWLTSRDLTPGPLDTPSSEDEFMPSCKN 472 
Boviné TLR9 |= AALGEVDSRVEVWRLPRGLAPGPLDAVSSKDEMPSCN- 470 
Mouse TLR9 | ATPEEADDAEQEELLSADPHPAPLSTPASKNEMDRCKN 473 
Extended Data Figure 1 | Sequence alignment of human, horse, bovine the EcTLR9-DNA1668_12mer complex is indicated by bold orange bars 
and mouse TLR9. Sequence alignments are displayed for each LRR module. _ below each LRR module. Alignments were performed using Clustal Omega 
The agonist DNA interfaces 1 and 2 deduced from the EcTLR9- software (EMBL-European Bioinformatics Institute). Residues are coloured to 
DNA1668_12mer complex are indicated by blue and yellow highlighting, indicate the degree of similarity: red residues are those with the highest 
respectively. The antagonist DNA interface deduced from the EcTLR9- similarity, followed by green, blue and black (lowest similarity). 


iDNA4084 complex is indicated with boxes. The protein-protein interface in 
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Extended Data Figure 2 | The NF-«B activation experiments. 

a, DNA1668_12mer retains agonistic activity to TLR9. The NF-KB 

activation of wild-type mouse TLR9 induced by DNA1668 
(TCCATGACGTTCCTGATGCT), DNA1668_12mer (CATGACGTTCCT), 
DNA1668 or DNA1668_12mer with a CpG to GpC inversion (DNA1668_GC, 
DNA1668_12mer_GC). DNAs were all complexed with N-[1-(2,3- 
Dioleoyloxy)propyl]-N, N, N-trimethylammonium methyl-sulphate (DOTAP) 
and added at concentration of 1 1M. The activities were analysed with an NF- 
«B-dependent luciferase reporter assay using HEK293T cells co-expressing 
mouse TLR9 and mouse Unc93B1. A two-tailed t-test was used to determine 
the statistical significance of differences between control (Ctrl) and stimulated 
cells, or between each group. **P < 0.01. Data from three independent 
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experiments are shown. b, Horse and bovine TLR9 responses against agonistic 
and inhibitory DNAs. The NF-«B activation of wild-type horse (left) or 
bovine (right) TLR9 induced by indicated DNAs. The activities were analysed 
with an NF-«B-dependent luciferase reporter assay using HEK293T cells 
co-expressing horse or bovine TLR9 and human Unc93B1. The concentration 
of agonistic DNAs (DNA1668, DNA1668_GC and DNA1668_met) and 
inhibitory DNAs (iDNA4084 and iDNA_super) were 10 1M and 1 1M, 
respectively. Data represent the mean fold induction of NF-«B activity +s.d. 
(n = 3). A two-tailed t-test was used to determine the statistical significance of 
differences between control (Ctrl) and stimulated cells, or between each 
group. **P< 0.01. Data from three independent experiments are shown. 
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Extended Data Figure 3 | Dimerization interaction of TLR9. a, b, The coefficients (S,,) were plotted against TLR9-DNA1668_12mer concentration 
oligomerization states of ECTLR9 with an unprocessed Z-loop (a) or a to determine the Kg value for the dimerization. The dissociation constant for 


processed Z-loop (b) were analysed by SV-AUC at various concentrations of _ the dimerization of the processed TLR9 is estimated to be 20 1M. 
TLR9-DNA1668_12mer (equimolar). The weight-average sedimentation 
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BtTLR9/DNA1668_12mer 


DNA1668_12 mer 
5'-CATGACGTTCCT-3' oO? 


MmTLR9/iDNA_super 


iDNA4084 5’-CCTGGATGGGAA-3’ iDNA4084 5’-CCTGGATGGGAA-3’ iDNA_super 5’--CCTCAATAGGGTGAGGGG-3’ 


Extended Data Figure 4 | Electron densities of DNA bound to TLR9. 

a-e, The F,-F. omit difference electron densities of DNA1668_12mer bound to 
EcTLR9 (a) and BtTLR9 (b), iDNA4084 bound to EcTLR9 (c) and MmTLR9 
(d), and iDNA_super bound to MmTLR9 (e) contoured at the 3.0a level. 
The residues coloured blue in the sequence are not visible in the electron density 
map. The core hexamer could be unambiguously modelled into the continuous 


electron density map in the ECTLR9-DNA1668_12mer and BtTLR9- 
DNA1668_12mer complexes, whereas flanking bases were obscure or not 
visible. The A5-T12 loop connecting the base-paired region of iDNA_super 
was not visible in the electron density map, whereas the G4—A6 sequence of 
iDNA4084 was well defined. 
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Extended Data Figure 5 | Structures of TLR9. a, Monomer structure of 
EcTLR9, derived from the EcTLR9-DNA1668_12mer complex. The structure 
and binding mode of TLR9 are markedly different from those of other CpG- 
binding proteins”*'*’. b, Monomer structure of human TLR8, derived from 
the human TLR8-CL097 (2-(ethoxymethyl)-1H-imidazo[4,5-c]quinolin-4- 
amine) complex (PDB ID: 3W3J)’*. The Z-loops in TLR9 and TLR8 are 
oriented differently with respect to the concave face of TLR and engage in 
different interactions with it. The latter half of the Z-loop of TLR8 extends 
towards the N-terminus, whereas the Z-loop of TLR9 extends towards the 
C-terminus to interact with LRR15-21. TLR8 has three ordered N-glycans 
attached to Asn293, Asn511 and Asn590 that project into the inner space of the 
ring structure, whereas EcTLR9 has only one N-glycan attached to Asn731 
that projects inward. As a result, the ring structure of TLR9 has more 
unoccupied inner space than that of TLR8, an arrangement that is suitable for 
ligand binding on the concave interior surface. c, Superposition of the overall 
ligand-induced dimer structures of ECTLR9 (DNA1668_12mer complex, 
green) and human TLR8 (CL097 complex, purple) by PyMol”, yielding an 
root-mean-square deviation (r.m.s.d.) value of 2.3 A. d, Superposition of the 
overall dimer structures of EcCTLR9 (DNA1668_12mer complex, green) and 
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BtTLR9 (DNA1668_12mer complex, purple) by PyMol”, yielding an r.m.s.d. 
value of 0.7 A. e, Magnified view of the CpG-binding groove of ECTLR9 and 
BtTLR9. The amino acid at position 109 was proline for human and serine 
for mouse (rodents). From the structure, Prol09 made a van der Waals contact 
with A at the -1 position, but its contact was somewhat close. If A is changed to 
T, this contact is weakened. Serine at position 109 would accommodate a 
larger base. f, Magnified view of the G10 of iDNA4084 recognition by TLR9 
(EcTLR9-iDNA4084 complex). G10 of iDNA4084 makes three hydrogen 
bonds with TLR9: N2, N1 and 06 atoms of G10 with the side chains of Ser205, 
Asp175 and Ser151, respectively. To examine the functional importance of this 
base, we substituted it with other bases and examined the binding affinity 

by ITC. The affinity of TLR9 for G10A (Ka = 6 nM) was reduced from that of 
the original sequence (Kg = 3nM), but TLR9 exhibited much lower affinity 
for DNAs with a pyrimidine at this position (Ka = 41nM for G10C and 

Ka =76nM for G10T), suggesting that this position favours purine over 
pyrimidine. g, Superposition of iDNA4084 bound to EcTLR9 (green) and 
MmTLR9 (purple). The binding mode of iDNA4048 is perfectly conserved 
between EcTLR9 and MmTLR9. 
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EcTLR9/iIDNA4084 
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EcTLR9/DNA1668_12mer (pH 6.0) 


Extended Data Figure 6 | Binding interfaces for agonistic and inhibitory 
DNA. a, Superposition of the structures of unliganded (grey), 
DNA1668_12mer-bound (green), and iDNA4084-bound (blue), EcTLR9. 
TLR9 and DNA are shown in Cw-trace and stick representation, respectively. 
No significant conformational change was observed upon agonistic DNA 
binding, as indicated by the small r.m.s.d. value of 0.8 A between EcTLR9 
(unliganded) and the EcCTLR9-DNA1668_12mer complex. Instead, the 
conversion of EcTLR9 into the activated form appears to involve local 
conformational changes in the loop regions of LRR8, LRR11 and LRR18, all of 
which are involved in formation of the dimer. No significant structural change 
was induced by binding to iDNA4084, as indicated by the small r.m.s.d. values 
of 0.49 A and 0.45 A between the unliganded and iDNA4084-bound forms of 
EcTLR9 and MmTLR3, respectively. b, Surface representations of EcCTLR9 
structures in the DNA1668_12mer (upper) and iDNA4084 complexes (lower). 
The protein-protein interface, TLR9_DNA1668_12mer interfaces 1 and 2, and 
TLR9_iDNA4084 interface are shown in orange, blue, cyan and yellow, 
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respectively. The bound DNAs are shown in stick representation with their 5’ 
and 3’ ends indicated. DNA1668_12mer buries approximately 1,136 A* and 
294 A? of the accessible surface area of TLR9 and TLR9*, respectively, 
suggesting that the N-terminal binding site of TLR9 for DNA1668_12mer 
makes a relatively larger contribution to binding. The binding site for iDNA 
partially overlaps with the binding site for agonistic DNA. Specifically, LRR4 
and LRRS5 are both involved in the binding sites for agonistic DNA and iDNAs, 
although the binding modes of DNA1668_12mer and iDNA4084 are 
completely different: Arg152 (LRR4), Tyr179 and Lys181 (LRR5) interact with 
the phosphate of C11 of DNA1668_12mer but also with G8 of iDNA4084. 

c, Electrostatic potential map of DNA-binding region. The map was calculated 
at basic and acidic conditions by PyMol’*. Surface colours represent the 
potential from —20 kgT/e (red) to 20 kgT/e (blue), where kg is the Boltzman 
constant and T is the absolute temperature. The DNA molecule is shown as a 
stick model. 
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Extended Data Figure 7 | ITC thermograms for ITC data (related to Extended Data Table 2). Representative ITC thermograms for the ITC data are shown 
with their pH condition and Kg values for EcTLR9. 
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Extended Data Figure 8 | Significance of the consensus sequence in the 
CpG-DNA. Gel-filtration chromatography of EcTLR9 with FITC-labelled 
DNA. DNA binding to TLR9 was monitored by FITC-fluorescence (excitation 
495 nm, emission 520 nm). The parenthesized values indicate the ratios of 
the fluorescence peak height of the derivative DNA to the original DNA 
containing the consensus sequence of GACGTT (top left). In each experiment, 
0.5 uM EcTLR9 (total volume of 45 il) with DNA (equimolar) was injected into 
a Superdex 200 Increase 5/150 GL (GE healthcare) gel-filtration column. 

The running buffer was 10 mM 2-morpholinoethanesulfonic acid and 250 mM 
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NaCl at pH 5.5. DNAs used in the analyses are shown in each panel. Bases that 
are changed from the original sequence are highlighted in red. Conversion of 
the purine-purine sequence (GA) at the -1 and -2 positions of the CpG 
motif (underlined) to AA and GG resulted in DNAs with affinities similar to the 
wild-type DNA, but conversion to a pyrimidine—pyrimidine sequence (TT and 
CC) weakened the affinity, demonstrating that a purine-purine sequence is 
preferable at these positions. Conversion of the base at the + 1 position into C, A 
or G led to weaker binding, suggesting that T is preferable at the +1 position, 
although T is not specifically recognized. 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Table 1 | Crystallization, data collection and refinement statistics 


Crystallization 
Protein solution 


Reservoir solution 


Cryoprotectant solution. 


Data collection 
Space group 
Cell dimensions 
a, b,e (A) 
@ Boy () 
Resolution (A) 


Rosa 

IisI 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
No. reflections 
Revere! Réce 
No. atoms 
Protein 
DNA 
Water 
B-factors 
Protein 
DNA 
Water 
R.m:s deviations 
Bond lengths (A) 
Bond angles (°) 


EcTLR9 
(Unliganded) 


4.0 mg/ml EcTLR9 
10 mM Tris pH 7.5 
0.15 M NaCl 


10-14% (w/v) PEG3350 
0.2 MAS 

0.1 M Na acetate pH 4.5 
10% (v/v) ethylene 
glycol 

15% (wiv) PEG3350 
0.2MAS 

0.1 M Na acetate pH 4.5 
20% (v/v) ethylene 
glycol 


c2 


98.1, 118.8, 77.8 
90, 108.7, 90 
36.8-2.40 
(2.53-2.40)* 
0.162 (0.611) 
6.12.1) 

100.0 (100.0) 

3.6 (3.6) 


36.9-2.4 
31,314 
19.7/25.0 


5,909 


EcTLR9 
(DNA1668_12mer) 


4.3 mg/ml EcTLR9 

10 mM Tris pH 7.5 

0.35 M NaCl 

140 uM DNA1668_12mer 
1/10 vol. of Silver Bullet AGF 
1% (w/v) PEG 4000 

1% (v/v) 2-propanol 

0.2 M Na acetate 

0.1 M Na citrate pH5.8 

5% (v/v) DMSO 

5% (wiv) PEG 4000 

5% (v/v) 2-propanol 

0.15 M Na acetate 

0.25 M NaCl 

0.1 MNa citrate pH5.8 

10% (v/v) DMSO 

20% (v/v) glycerol 

1/10 vol. of Silver Bullet A6 


Pl 


68.8, 81.6, 81.5 
87.5, 83.5, 86.0 
50.0-1.60 
(1.66-1,60) 
0.078 (0.537) 
9.0 (L.8) 

95.7 (84.4) 

3.8 (2.8) 


47.5-1.60 
210,033 
18.4/22.0 


12,022 
478 
1,595 


ww tw 
vos 
aww 


= 
eS 
ae 
aon 


EcTLR9 
(iDNA4084) 


5.8 mg/ml EcTLR9 
10 mM Tris pH 7.5 

0.35 M NaCl 
130 1M iDNA4084 


10-14% (w/v) PEG 3350 
0.2 MAS 

0.1 M Na acetate pH 4.5 
10% (v/v) ethylene glycol 


15% (w/v) PEG 3350 
0.2MAS 

0.1 M Na acetate pH 4.5 
20% (v/v) ethylene glycol 


C2 


117.6, 117.1, 69.6 
105.5 

67.1-2.75 
(2.90-2.75) 
0.143 (0.530) 
7.22.3) 

99.9 (99.9) 
3.53.2) 


EcTLR9 
(DNA1668_12mer) 


6.8 mg/ml BtTLR9 

10 mM Tris pH 7.5 

0.35 M NaCl 

150 uM DNA1668_12mer 


3-5% (w/v) PEG 4000 
0.2 M Na acetate 

0.1 M Na citrate pH 5.5 
15% (v/v) DMSO 


8% (w/v) PEG 4000 
0.2 M Na acetate 

0.1 M Na citrate pH 5.5 
15% (v/v) DMSO 

25% (v/v) glycerol 


CQ 


115.8, 113.1, 67.4 
100.6 

481-238 
(2.51-2.38) 
0.113 (0.598) 
7.7.1) 

99.8 (99.3) 

3.6 (3.5) 


44.6-2.38 
32,294 
21.7/274 
5,628 

124 

31 


50.5 


BtTLR9 
(Unliganded) 


5.1 mg/ml MmTLR9 
10 mM Tris pH 7.5 
0.15 M NaCl 


6-8% (w/v) PEG 8000 
0.4MAS 

0.1 MTris pH 8.0 
10% (v/v) DMSO 


10% (w/v) PEG 8000 
04MAS 

0.1 M NaCl 

0.1 MTris pH 8.0 
10% (v/v) DMSO 
15% (v/v) glycerol 


P2212) 
71.3, 119.3, 130.4 


48.11-1.96 
(2.06-1.96) 
0.113 (0.486) 
7.9 (2.8) 
98.6 (98.4) 
433.8) 


44.6-1.96 


MmTLR9 
(G@DNA4084) 
Forml 


5.1 mg/ml 
MmTLR9 

10 mM Tris pH 7.5 
0.15 M NaCl 

110 uM iDNA4084 
6-8% (w/v) PEG 
8000 

0.4M AS 

0.1 M Tris pH 8.0 
10% (v/v) DMSO 
10% (w/v) PEG 
8000 

0.4M AS 

0.1 M NaCl 

0.1 M Tris pH 8.0 
10% (v/v) DMSO 
15% (v/v) glycerol 


0.120 (0.836) 
9.6 (1.9) 
99.9 (99.8) 
6.9 (5.8) 


MmTLR9 
(iDNA4084) 
Form2 


5.1 mg/ml MmTLR9 
10 mM Tris pH 7.5 
0.15 M NaCl 

110 tM iDNA4084 


6-8% (w/v) PEG 
8000 

04M AS 

0.1 M Tris pH 8.0 
10% (v/v) DMSO 
10% (w/v) PEG 8000 
04M AS 

0.1 MNaCl 

0.1 M Tris pH 8.0 
10% (v/v) DMSO 
15% (v/v) glycerol 


0.084 (0.453) 
8.5(2.1) 
99.7 (99.3) 
3.5 (3.1) 


38.9-2.33 
53,154 
20.8 / 24.8 


5,728 
248 
150 


49.1 
37.6 
42.2 


0.005 
1.00 
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MmTLR9 
(iDNA_super) 


5.6 mg/ml MmTLR9 
10 mM Tris pH 7.5 
0.35 M NaCl 

120 uM iDNA_ super 


6-8% (w/v) PEG 8000 
0.4MAS 

0.1 M Tris pH 8.0 
10% (v/v) DMSO 


10% (w/v) PEG 8000 
04M AS 

0.1 MNaCl 

0.1 MTris pH 8.0 
10% (v/v) DMSO 
15% (v/v) glycerol 


c2 


137.3, 129.7, 71.4 
92.2 

57.6-2.25 
(2.37-2.25) 
0.072 (0.449) 
10.2 (2.2) 

97.2 (94.0) 

3.7 (3.3) 


Each data set was collected with one crystal. 
* Highest resolution shell is shown in parenthesis. 
+ Silver Bullet is a crystallization screen kit for biological macromolecules (Hampton Research Inc.). Silver Bullet A6 contains 0.33% (w/v) 2,6-naphthalenedisulfonic acid disodium salt, 0.33% (w/v) 2,7- 
naphthalenedisulfonic acid disodium salt, 0.33% (w/v) anthraquinone-2,6-disulfonic acid disodium salt and 0.02 M Na-HEPES pH 6.8. 
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Extended Data Table 2 | ITC results 


. > K. AH AS No. of 
Protein pe ONS Sequence 5’—3’ (nn) (kcalimol) _(cal/mol/deg) dypeniteant 

Horse TLR9_WT 6.0 DNA1668_12mer CATGACGTTCCT 2045 -18.841.5 -26.94+5.9 0.5 4 
6.0 DNA1668_12mer_met CATGAC(met)GTTCCT 50 +12 -11.9406 65421 0.6 5 

6.0 DNA1668_12mer_GC CATGAGCTTCCT 570+ 47 -7.7+0.8 2842.7 0.6 3 

6.0 DNA1668_12mer_TG CATGATGTTCCT 160 + 34 124405 10.5422 0.7 3 

6.0 DNA1668_12mer_CA CATGACATTCCT 880 + 160 -41+0.6 14.1423 0.7 3 

6.0 DNA1668_12mer_UG CATGAUGTTCCT 54410 -17.442.11 -24.947.4 0.7 4 

6.0 iDNA4084 CCTGGATGGGAA 3+1 9640.2 6.7 41.1 0.5 3 

6.0 iDNA4084_G10A CCTGGATGGAAA 6+1 -25.14+0.2 46.4411 0.5 3 

6.0 iDNA4084_G10C CCTGGATGGCAA 41412 -27.441.2 -57.9 44.1 0.4 3 

6.0 iDNA4084_G10T CCTGGATGGTAA 7645 -21.740.3 -40.141.1 0.4 3 

6.0 iDNA4084_C2GG8C CGTGGATCGGAA 750 -2.8 18.7 0.5 1 

6.0 dsDNA1668_12mer ee oc 1800 13.4 71.3 0.5 1 

6.0 dsiDNA4084 sera cloaca 690 24.7 111.0 0.5 1 

Horse TLRQ_WT 6.4 DNA1668_12mer CATGACGTTCCT 120 -22.0 41.9 0.6 1 
6.8 DNA1668_12mer CATGACGTTCCT 240 -29,1 -67.3 0.5 1 

7.2 DNA1668_12mer CATGACGTTCCT 550 -36.5 -93.9 0.4 1 

7.6 DNA1668_12mer CATGACGTTCCT 1100 -30.2 -74.0 0.5 1 

8.0 DNA1668_12mer CATGACGTTCCT 2500 -26.0 -61.5 0.5? 1 

Horse TLR9_W47A 6.0 DNA1668_12mer CATGACGTTCCT 680 + 160 -11642.2 -10.5+47.5 1.2 3 
6.0 iDNA4084 CCTGGATGGGAA 5+3 -11.440.2 -0.141.9 0.4 3 

Horse TLR9_W96A 6.0 DNA1668_12mer CATGACGTTCCT 1300 + 660 -11.4429 -10.9 411.2 0.9 3 
6.0 iDNA4084 CCTGGATGGGAA 6+3 12.7424 4846.8 0.4 3 

Horse TLR9_F108A 6.0 DNA1668_12mer CATGACGTTCCT 800 + 300 8143.4 0.7 412.2 1.0 3 
6.0 iDNA4084 CCTGGATGGGAA 1046 -10.542.0 1.7468 0.4 3 
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The octahaem MccA is a haem c-copper sulfite 


reductase 


Bianca Hermann!, Melanie Kern’, Luigi La Pietra’, J org Simon? & Oliver Einsle’? 


The six-electron reduction of sulfite to sulfide is the pivot point of 
the biogeochemical cycle of the element sulfur’. The octahaem cyto- 
chrome c MccA (also known as SirA) catalyses this reaction for dis- 
similatory sulfite utilization by various bacteria. It is distinct from 
known sulfite reductases because it has a substantially higher cata- 
lytic activity and a relatively low reactivity towards nitrite. The mech- 
anistic reasons for the increased efficiency of MccA remain to be 
elucidated. Here we show that anoxically purified MccA exhibited 
a 2- to 5.5-fold higher specific sulfite reductase activity than the 
enzyme isolated under oxic conditions”. We determined the three- 
dimensional structure of MccA to 2.2 A resolution by single-wavelength 
anomalous dispersion. We find a homotrimer with an unprecedented 
fold and haem arrangement, as well as a haem bound to a CX,;CH 
motif*. The heterobimetallic active-site haem 2 has a Cu(I) ion jux- 
taposed to a haem cat a Fe-Cu distance of 4.4 A. While the combi- 
nation of metals is reminiscent of respiratory haem-copper oxidases, 
the oxidation-labile Cu(I) centre of MccA did not seem to undergo 
a redox transition during catalysis. Intact MccA tightly bound SO, 
at haem 2, a dehydration product of the substrate sulfite that was 
partially turned over due to photoreduction by X-ray irradiation, 
yielding the reaction intermediate SO. Our data show the biometal 
copper in a new context and function and provide a chemical ratio- 
nale for the comparatively high catalytic activity of MccA. 
Sulfate-reducing microbes couple the reduction of sulfate, SO,’ , and 
sulfite, SO,” , to the generation of a proton motive force to sustain or- 
ganismic growth. A single enzymatic step is required for the six-electron 


Figure 1 | Structure of W. succinogenesMccA. a, Stereo representation of the 
MccA monomer coloured from blue (N terminus) to red (C terminus), 
highlighting the prominent trimerization helices. b, Surface representation 

of the functional trimer of MccA. The trimerization helices are at the bottom 
and the substrate access funnel in the globular domain is well visible. c, Top 
view of the trimer, coloured as in b. d, Arrangement of the 24 iron atoms within 
the MccA trimer. The haem numbering in the left monomer follows the 


reduction of sulfite—present at physiological pH (pK, = 6.97) in equi- 
librium with the monovalent hydrogen sulfite anion, HSO; —to sul- 
fide, either using the tetrapyrrol cofactor sirohaem in assimilatory or 
dissimilatory sulfite reductases’, or a haem c as a side reactivity of the 
pentahaem cytochrome c nitrite reductase NrfA**. Reported specific 
sulfite reduction rates for these enzymes are low, with turnover number 
(keat) Values ranging from 0.2 to 2 s_' (refs 4, 6, 7). All known sulfite 
reductases also catalyse the six-electron reduction of nitrite, NO, to 
ammonium, NH,', and in the case of NrfA, a single-point variant at 
the active site loses nitrite reductase activity, while leaving its basal sul- 
fite reductase activity unchanged*. The genome of the Epsilonproteo- 
bacterium Wolinella succinogenes does not encode a sirohaem sulfite 
reductase and the nrfA gene is not induced during sulfite respiration’. 
Instead, sulfite is reduced by the octahaem c-type cytochrome MccA, 
with sulfide as the sole product”®. The reported activity of MccA ex- 
ceeded the activity levels reported for other sulfite-reducing enzymes 
by far, but the mechanistic basis for this higher efficiency remained 
unclear'*. MccA belongs to the genetically diverse family of multihaem 
c enzymes and has eight haem groups covalently attached to conserved 
haem-binding motifs in the peptide sequence”*”. Multihaem cytochrome 
c enzymes show a high conservation of haem group arrangements, but 
not of sequence, with recurring haem-packing motifs that result in either 
a parallel or a perpendicular packing of two of the moieties'*"'. They 
catalyse complex multi-electron redox reactions and bind their substrates 
through the free electron pairs of a heteroatom to a free coordination 


Active 
site 


12.3A 


sequence of attachment to the protein chain. The active sites are located at 
haem 2 of each subunit. Fe-Fe distances are given for one monomer, and 

the closest Fe-Fe distance between monomers is between haems 1 and 4’ 

at 17 A. e, Cofactor arrangement. The 24 haem groups form a complex pattern 
of interacting parallel and perpendicular packing motifs. f, A top view of 

the haem groups emphasizes the trimeric cofactor arrangement. 


1Lehrstuhl Biochemie, Institut fur Biochemie, Albert-Ludwigs-Universitat Freiburg, Albertstrasse 21, 79104 Freiburg, Germany. Microbial Energy Conversion & Biotechnology, Department of Biology, 
Technische Universitat Darmstadt, Schnittspahnstrasse 10, 64287 Darmstadt, Germany. °BIOSS Centre for Biological Signalling Studies, Schanzlestrasse 1, 79104 Freiburg, Germany. 
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Figure 2 | Haem-copper active sites in respiratory enzymes. 

a, W. succinogenes MccA binds its substrate sulfite in the dehydrated form, SO, 
at the distal axial position ofhaem 2. At 3.2 A distance from the sulphur atom, a 
Cu(1) ion is nearly linearly coordinated by residues C399 and C495. b, In 
respiratory haem-copper oxidases, Cug is a redox-active species liganded by 
three histidine residues and juxtaposed to a haem a3 moiety. The arrangement, 
with a Fe-Cu distance of 4.9 A, is optimized to bind O, and peroxide in a 
bridging fashion (Protein Data Bank accession 3ABM). 


position at an active-site haem group. Electrons are then provided or 
accepted by the tightly coupled chain of haem groups”. To understand 
the functional specifications that enable the increased sulfite reductase 
activity of MccA, we isolated the enzyme from W. succinogenes and de- 
termined its three-dimensional structure to 2.2 A resolution (Extended 
Data Table 1). The active site of MccA contained an additional metal 
ion that was depleted during oxic isolation procedures. This ion was 
identified as Cu(I), making MccA a new type of heterobimetallic oxi- 
doreductase, reminiscent of the membrane-integral respiratory haem- 
copper oxidases. 

Physiological sulfite respiration occurs under anaerobic conditions, 
but Strep-tagged MccA from W. succinogenes was previously isolated 
only in the presence of dioxygen**. Here, we produced MccA both under 
strict exclusion of dioxygen (form I) and under oxic conditions (form II) 
for comparison. First, crystals were obtained for form II at <5 p.p.m. 
O,, and whereas the anoxically prepared form I crystallized as it was 
isolated, the crystallization of MccA II required reduction with dithio- 
nite or the addition of sulfite. The protein was modelled from amino 
acids M31 to S689, with disorder only in the termini (Fig. 1a). Aside 
from the relevant changes at the active site (discussed later), no major 
differences were observed between the individual structures. The bio- 
logical assembly of MccA is a trimer with approximate dimensions of 
90 X 90 x 90 A? (Fig. 1b, c), and the monomer can be subdivided into 
two domains, with a globular amino-terminal domain that contains few 
secondary structure elements. This domain holds eight covalently attached 
haem groups, one of which (haem 2) forms the active site (Fig. 1d, e). 
Haem 8 is attached to a non-canonical binding motif (CX,,;CH) within 
the peptide chain, as previously shown by mass spectrometry’. The 
carboxy-terminal domain consists of three long a-helices that form an 
extended multimerization interface, similar to that in the pentahaem 
NrfA**** and the octahaem proteins hydroxylamine oxidoreductase 
(HAO) from Nitrosomonas europaea™ and nitrite reductase from 
Thioalkalivibrio nitratireducens"’. In all these cases, the positioning of 
these helices and the resulting multimers are different, while the relative 
orientation of the haem groups is conserved (Fig. 1a, d and Extended 
Data Fig. 1). The haems of MccA line up at the inner periphery of the 
trimer, allowing access only to the active-site haem 2 situated almost in 
the centre of the domain, and to haem 8, the likely electron entry site 
(Fig. 1d, f). Among cytochromes of known structure, the haem groups 
align best with those of the monomeric octahaem tetrathionate reduc- 
tase (OTR) from Shewanella oneidensis'®, in which haem 2 forms the 
active site in both cases (Extended Data Fig. le). Fe-Fe distances range 
between 9.0 and 9.9 A for the parallel packing motifs and between 11.7 
and 12.4 A for the perpendicular motifs, allowing for high electron trans- 
fer rates. Between two monomers of the trimer, haem 1 and haem 4’ are 
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Figure 3 | Sulfite reduction at haem 2 of MccA. a, Sulfite binds to the resting 
state of the enzyme, but the active-site architecture shifts the reversible 
dehydration equilibrium to SO, + H,0. b, Reduction by two electrons occurs 
through photoreduction in the X-ray beam and leads to release of a second 
H,0 after protonation, with SO remaining bound to the active site. A tight 
network of hydrogen bonds surrounds the bimetal centre and the bound 
substrate, holding the reaction products in place. 


at the closest Fe-Fe distance of 17.0 A, with 9.3 A separating the por- 
phyrin edges (Fig. 1d). All haems are bis-histidinyl coordinated, with 
the exception of the penta-coordinate active-site haem 2 that has the 
distal axial position available for ligand binding. In the CX,;CH motif 
of haem 8, the extended region between the two cysteine residues forms 
a loop with a short helical turn, in direct vicinity to another loop 
harbouring the only non-proline cis peptide in the enzyme, between 
residues G508 and F509 (Extended Data Fig. 2). Its formation might 
require the essential peptidyl isomerase MccB’, and it is presumed to 
be a prerequisite for correct folding of the loop in the maturation pro- 
cess of haem 8, which is likely to be attached by the dedicated cyto- 
chrome c synthase CcsA1 (refs 17-19). The structure of the CX,;CH 
haem c binding motif disrupts the general parallel/perpendicular 
haem stacking sequence, and rotates the haem out of plane, possibly 
to optimize the interaction with the putative electron donor, the iron- 
sulfur protein MccC (Extended Data Fig. 1a)’. 

The active-site haem 2 is bound to a canonical CKXCH motif with 
H306 as a proximal axial ligand (Fig. 2a). An unprecedented feature of 
MccA is the presence of two cysteine residues, C399 and C495, juxta- 
posed at the distal side of the active-site cavity. In the initial structure of 
form II of the enzyme, an additional electron density maximum was ob- 
served between the cysteine thiolates, coordinated in an extended geo- 
metry at distances of 2.1 A and 2.2 A, respectively, with an S-X-S angle 
between 140° and 145°. This mode of coordination was suggestive of 
Cu(I), so that in a fully occupied form I crystal we used anomalous 
double-difference (AA,,.) electron density maps collected at the X-ray 
absorption edges of several third-row elements (Co, Ni, Cu, Zn, As) to 
confirm unambiguously the identity of the atom as copper (Extended 
Data Fig. 3). In form II MccA this site was only partially occupied, re- 
sulting in a weaker maximum for the electron density peak and a mix- 
ture of two conformations with partial formation of a disulfide bond 
between C399 and C495 (Extended Data Fig. 4). The oxidative decom- 
position of the haem-copper site is not reversible by the reduction or 
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Figure 4 | Mechanism of stepwise sulfite reduction by MccA. a-d, The 
schemes represent a view onto the plane of haem 2. a, Sulfite is bound at an 
oxidation state of S*'V and immediately dehydrated. All oxygen atoms are fixed 
in a tight hydrogen-bonding network (Fig. 3a). b, Upon transfer of two 
electrons to yield an S*™ state, a second oxygen atom is released as water, but 


addition of Cu(I) salts, indicating that its assembly requires further fac- 
tors. One of these may be an orthologue of the copper chaperone NosL, 
which is encoded by a gene that forms part of the mcc gene cluster’*. 
The geometry of the copper site suggests a preference for Cu(I), and 
metal determinations by inductively coupled plasma emission mass 
spectrometry (ICP-MS) indeed confirmed that forms I and II of MccA 
differ markedly in copper content. The high-copper form I, with an 
average Cu content of 0.7-1.0 per monomer, also showed higher cat- 
alytic activity (kcat = 200.6 s ') than form IJ, with an average of 0.2- 
0.5 Cu per monomer (Kat = 99.5 s |; Extended Data Fig. 5), in good 
agreement with the Cu occupancy observed in the crystal structures. 
The active centre of MccA thus consists of an unprecedented haem 
c-Cu(I) arrangement, with a metal-metal distance of 4.4 A (Fig. 2a). 
This is a slightly shorter distance than the 4.9 A observed in the haem 
a,—Cug site of cytochrome c oxidase (Fig. 2b), the only other haem- 
copper enzyme known, where the diatomic peroxide intermediate binds 
in a linear 1-1:n bridging mode”. 

MccA reduced sulfite, but not arsenate, selenate, selenite, hydroxyl- 
amine, hydrazine, fumarate, nitrate, thiosulfate, tetrathionate, polysul- 
fide or Fe(III)**. Nitrite was reduced only very slowly to ammonium, 
with a ka, of 16.1 s / for form I (Extended Data Fig. 5 and Extended 
Data Table 2). The unusual catalytic properties of MccA can be ratio- 
nalized from its active site, a shallow cavity on the distal side of haem 2, 
lined by residues K208, Y285, Y301, R366 and K393, which are con- 
served among MccA orthologues (Fig. 3a). In the copper-depleted form 
II of MccA, the absence of the heterometal allowed for a binding mode 
of sulfite that was similar to the one seen in the sirohaem-containing 
enzymes or in NrfA (Extended Data Fig. 4)*?’. In the structure of the 
Cu-containing, high-activity form I of MccA, however, all 12 mono- 
mers in the asymmetric unit had a ligand bound to haem 2. While in 
most cases this was a mixture of two species, the electron density maps 
for some monomers were unambiguous, revealing one of two distinct 
species. In chain D this was an SO, molecule, coordinated through sul- 
fur (Extended Data Fig. 6a). The two oxygen atoms of the ligand formed 
short hydrogen bonds to R366 and K208, respectively, with an addi- 
tional interaction of both with K393 from the roof of the cavity. Fur- 
thermore, a water molecule contacted both sulfite oxygens and the 
hydroxyl groups of Y285 and Y301 (Fig. 3a). In chain H the dominant 
species was an SO molecule, probably generated through photoreduc- 
tion during X-ray data collection or the exposure to H2 in an N2/H2 at- 
mosphere during oxygen-free crystallization. It coordinated the haem 
iron with its sulphur atom, which was readily distinguishable from the 
lighter oxygen (Extended Data Fig. 6b). The oxygen atom was hydro- 
gen bonded to R366, and two water molecules were defined. One was 
bound to Y285 and Y301, while the other formed hydrogen bonds with 
K208 and Y285 (Fig. 3b). Overall, the hydrogen-bond network in the 
active-site cavity remained almost unchanged between both states, while 
a water molecule was removed from SO3. No electron density was ob- 
served between the substrate molecule and copper, and the sulfur- 
copper distance was 3.2 A for SO, and 3.3 A for SO. 
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held within the active-site cavity (Fig. 3b). c, A further two-electron reduction 
leads to the S° state, weakening the remaining S-O bond. d, With the final 
transfer of two electrons, the state of sulfide (s_4) is reached and a third water is 
released. In all intermediates, protons are omitted for clarity. 


These snapshots of sulfite reduction at MccA suggest that the six- 
electron reduction to sulfide starts with the binding of sulfite to haem 2 
and an immediate dehydration to SO,*H,O (Figs 3a and 4a). The linear 
Cu(I)-S, arrangement supports this step with an overall negative charge, 
but the binding and dehydration does not yet involve reduction of the 
substrate and the SO, sulphur atom remains at the oxidation level S*'Y 
(Fig. 4a). The arrangement of haem groups in the protein then promotes 
three consecutive two-electron reduction steps, leading via S*" =O 
(Figs 3b and 4b), after elimination of a second water molecule, and 
S°-O (Fig. 4c), where the third HO is released, to sa (Fig. 4d). Here, 
residue K393 plays an important part, as it forms hydrogen bonds to 
both oxygen atoms of SO, and also to the O,, atom of Y285 (Fig. 3a). 
After reduction by two electrons, a water molecule moves towards Y285 
by 1.5 A, breaking its hydrogen bond to K393, but retaining the one to 
K208 (Figs 3b and 4b). The release of a third water molecule probably 
then severs the remaining hydrogen bond to K393 and possibly weak- 
ens the K393-Y285 interaction, leading to the deconstruction of the 
hydrogen-bond network in the active site and product release. The 
structures elucidate why MccA does not generate side products such as 
thiosulfate or trithionate through the closed hydrogen-bond network 
that holds all reaction partners tightly in place during catalysis and re- 
tains the oxygen atoms well defined as bound product water (Fig. 3b 
and Extended Data Fig. 6). The proposed mechanism does not require 
a change of oxidation state for the Cu(I) site, whose main task is to shift 
the dehydration equilibrium of the substrate towards SO}. This view is 
supported by the observation that oxidation by air led to an irreversible 
loss of copper (Extended Data Fig. 4). In the presence of eight strongly 
absorbing haem groups, a single, transient Cu(II) may be difficult to 
detect (Extended Data Fig. 7a), but under no conditions did time- 
resolved ultraviolet/visible spectroscopy provide any indication of the 
occurrence of oxidized copper (Extended Data Fig. 7). Also, the sulfite 
substrate only oxidized half the haem groups of reduced MccA (Ex- 
tended Data Fig. 8), emphasizing that the total electron charge of the 
multihaem enzyme is of high relevance for catalysis. Taken together, 
we have elucidated the structural and mechanistic basis of the most 
active bacterial sulfite reductase known to date, an enzyme that is as- 
sumed to contribute significantly to sulfite turnover in various habitats 
on Earth. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Protein purification and crystallization. MccA was produced from cells of mu- 
tant W. succinogenes MccA-2XStrep under fumarate-respiring conditions, as a 
variant with a C-terminal double Strep-tag II’”’. The cells were pelleted at 6,000g, 
resuspended in 50 mM Tris/HCl buffer, pH 8.0, and 150 mM NaCl, and disrupted 
by two passages through a French pressure cell at 138 MPa (SLM Aminco). The 
lysate was cleared by centrifugation at 100,000g for 60 min. The supernatant con- 
taining the target protein was loaded onto Streptactin affinity resin (IBA) and eluted 
ina single step with 2.5 mM of D-desthiobiotin. The eluted protein-containing frac- 
tions were further purified by size-exclusion chromatography on a HiLoad 26/60 
column with Superdex 200 prep grade material (GE Healthcare) that was pre- 
equilibrated and run in 20 mM Tris/HCl, pH 7.5, 150 mM NaCl. MccA eluted in 
two peaks, representing the trimeric and monomeric forms of the protein, respect- 
ively. For anoxic preparations, dioxygen was removed from all buffers by eight cycles 
of vacuum/N, using modified Schlenk techniques. Protein was isolated using closed 
systems, and further handling was carried out in an anoxic chamber at <5 p.p.m. 
pOs. For crystallization only the trimer fraction was used and concentrated to 
6mgml ' for MccA form I and to 10mgml? for form II by ultrafiltration in 
Vivaspin 20 concentrators (50 kDa molecular weight cut-off, Sartorius). 
Crystallization and X-ray diffraction data collection. Well-diffracting crystals 
of both forms I and II of MccA were obtained exclusively in an anoxic chamber at 
room temperature, using the sitting-drop vapour diffusion method. For MccA 
form II, 0.5 pl of protein solution (10 mg ml!) was mixed with 0.5 pl ofa reservoir 
solution containing 13-17% (w/v) of polyethylene glycol 3,350 and 100 mM CHES/ 
NaOH buffer at pH 8.5-9.0 and equilibrated against 100 1] of the same reservoir. 
Crystal growth required either the addition of the reductant sodium dithionite or 
co-crystallization with 200 mM of sodium sulfite. MccA form I crystallized from a 
solution of 6 mg ml’, with a reservoir solution of 10-15% (w/v) of polyethylene 
glycol 5,000 monomethyl ether, 100 mM HEPES/NaOH buffer at pH 7.0-7.5, 5% 
tacsimate, pH 7.0 (Hampton Research) and 4mM Zwittergent 3-12 (Anatrace) 
without addition of substrate or reaction intermediates. Diffraction data sets were 
collected on beamline X06SA at the Swiss Light Source (Villigen, Switzerland) using 
a Pilatus 6M detector (Dectris). Crystals of the copper-depleted MccA form II be- 
longed to the rhombohedral space group H3, with unit cell dimensions of a = b = 
186.5 Aandc = 232.8 A and four monomers per asymmetric unit, while MccA form I 
crystallized in monoclinic P2;, with a = 211.1 A, b= 103.0 A, c = 233.1 A, and 
f = 98.5° and had 12 monomers per asymmetric unit. For phase determination, a 
data set of a form II crystal was collected at the Fe K-edge at 2 = 1.739 A. Diffrac- 
tion data for the generation of anomalous double difference maps for identification 
of the additional metal site were collected at the Co K-edge at 1 = 1.608 A, the Ni 
K-edge at 2 = 1.488 A, the Cu K-edge at 2 = 1.381 A, the Zn K-edge at 2 = 1.283 A 
and the As-edge at 2 = 1.044 A. Diffraction data were indexed and integrated with 
XDS” and scaled with AIMLESS” from the CCP4 suite’. For data processing sta- 
tistics see Extended Data Table 1. 

Structure determination. Phase information for MccA form II was obtained by 
single-wavelength anomalous diffraction using data collected at the Fe K-edge. All 
32 Fe sites and several sulphur positions were located with SHELXD” and refined 
with SHARP”’. AutoSHARP was used for phase calculation, density modification 
and initial model building”. The structure was rebuilt and refined in iterative steps 


using COOT” and REFMAC5” to final Reryst/Riree Values of 0.16/0.18 at 2.1 A re- 
solution. This model was subsequently used to solve the structure of MccA form I 
by molecular replacement with MOLREP*". All structures were validated with 
MolProbity*. Refinement and validation statistics are summarized in Extended 
Data Table 1. 

Metal quantitation. The Cu content of MccA was determined by ICP-MS (Spu- 
renanalytisches Laboratorium Dr Baumann, Maxhiitte-Haidhof, Germany) to 0.7- 
1.0 Cu per monomer and 0.2-0.5 Cu per monomer for different preparations of 
forms I and II, respectively. Owing to the interfering presence of sodium ions the 
error of the analysis was large, but the electron density features observed in the 
corresponding crystal structures were in accordance with the obtained values. 
Activity assays. The specific activity of MccA for sulfite and nitrite reduction was 
determined at 303 K using benzyl viologen as an artificial electron donor. In the 
absence of dioxygen, benzyl viologen (0.4 mM) was reduced by oxalate (5 mM) via 
catalytic amounts of 5-deazaflavin upon illumination, and its reoxidation rate before 
and after addition of substrate was monitored at 562 nm. Final protein concentra- 
tions were 0.8 jig ml! for form I and 1.1 pg ml“! for form II. The assay was con- 
ducted in 100 mM potassium phosphate buffer at pH 7.0. Substrate concentrations 
were assayed from 5 {1M to 1 mM and kinetic parameters were determined from a 
sigmoidal fit (Extended Data Fig. 5). 
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Extended Data Figure 1 | Comparison of monomer structure, haem c, N. europaea hydroxylamine oxidoreductase (PDB accession 4FAS)*’. 
packing and quaternary structure in multihaem c enzymes. a-e, Monomer d, Hexameric T. nitratireducens nitrite reductase (PDB accession 3SXQ)””. 
structure is shown on the left, haem packing in the centre, and quaternary e, S. oneidensis tetrathionate reductase (PDB accession 1SP3). All haem 
structure on the right. a, W. succinogenes sulfite reductase MccA. arrangements are superimposed onto that of MccA (red). 


b, W. succinogenes nitrite reductase NrfA (PDB accession 1FS7)"°. 
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cis-peptide 


Extended Data Figure 2 | Uncommon structural features of MccA. Haem _ that contains the only non-proline cis-peptide bond of MccA between G508 
group 8 is linked to an unusual CX,;CH motif that results in an extendedloop and F509. Conceivably, the action of the peptidyl isomerase MccB is required to 
region between C562 and C578 (red), dislocating haem 8 from the regular, generate this conformation after insertion of Cu(I), since the deletion of the 
perpendicular haem-packing pattern otherwise observed in multihaem mccB gene prevented the formation of stable MccA’. 

enzymes. This region is adjacent to a loop starting at the copper ligand C495 
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Extended Data Figure 3 


Identification of the additional heteroatom as 
copper based on anomalous difference electron density maps. a, Diffraction 
data sets were collected at the absorption edges of Co (7,710 eV), Ni (8,333 eV), 
Cu (8,974 eV), Zn (9,664 eV) and As (11,866 eV, not shown in plot). 

b, Integrated electron density is shown for a sphere with r = 1.0 A around the 
Fe positions (grey) and the heterometal site (green) indicated in the individual 
data sets. The heterometal absorption properties behave as expected for Cu. 


Error bars represent standard deviations for data from 12 individual 
observations in the asymmetric unit of the crystal form. c, Anomalous 
difference (F* — F_) electron density maps calculated for the individual data 
sets at the elemental absorption edges and contoured at the 5c level. 

A maximum is observed only at energies corresponding to Cu and above, 
allowing for an unambiguous identification. 
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Extended Data Figure 4 | Structure of partially copper-depleted MccA. from the active site to form a disulfide bond. This leads to improved access to 
Protein isolated under oxic conditions commonly showed a decreased copper _ the active site at the distal axial position of the iron atom of haem 2, allowing 
content of 0.2-0.5 Cu per monomer. The electron density surrounding the for the binding of SO;?~. The short distance of 1.4 A between a sulfite 

active site was best interpreted as a mixture of copper-containing monomers, —_ oxygen and the Cu(I) ion indicated that the two indeed represent alternative 
probably with bound SO, as in Fig. 3a, and an oxidatively damaged, Cu- and mutually exclusive conformations, forms I and II, that overlay in 


depleted form, where the thiol groups of C399 and C495 have rotated away the crystal. 
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Extended Data Figure 5 | Representative steady-state kinetic analysis of 
MccA. a, The reactivity for sulfite and nitrite was assayed by following the 
reoxidation of benzyl viologen. MccA form I, isolated in the strict absence of 
dioxygen, showed higher activities for sulfite (black) and nitrite (green) than 
form II, isolated under oxic conditions (sulfite, blue; nitrite, yellow). For sulfite 
reduction, the kinetic parameters obtained were Vax = 151.3 +0.5 U mg 
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and Ky, = 43.9 + 2.3 uM for form I and Vmax = 71.8 + 0.5 U mg | and 

Ky = 63.9 + 9.4M for form II (see Extended Data Table 2 for parameters). 
b, pH dependence of the sulfite-reducing (black) and nitrite-reducing (green) 
activity of MccA. Sulfite reduction shows an apparent maximum at pH 6.2. 
All measurements were carried out in triplicate with the error bars representing 
standard deviations between individual experiments. 
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Extended Data Figure 6 | Difference electron density maps for bound 
substrate and intermediates. a, b, The stereo images show monomer D 
(a) and monomer H (b) of the P2, structure of form I MccA (Extended Data 
Table 1). Whereas in a an SO, molecule is bound to the Fe ion of haem group 2 
together with a single H.O molecule, photoreduction has led to the generation 


ofa second H,0, leaving SO bound to haem. The quality of the electron density 
maps allowed us to discern S and O atoms, but the state depicted in b may 
represent either an S(II) = O adduct (Fig. 4b) or the further reduced $(0)-O 
(Fig. 4c). All maps are F, — F, omit difference electron density maps 
calculated in the absence of ligands. 
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Extended Data Figure 7 | Copper detection by time-resolved electron 
excitation spectroscopy of W. succinogenes MccA. a, Owing to the presence 
of eight haem groups, a hypothetical, calculated absorption band of a typical 
type I copper site with an extinction coefficient of approximately 

5,000 M*cm~' at 600 nm (grey) would be difficult to detect in a background 
of oxidized (black) or reduced (purple) haems, but may possibly be extracted 
from the well-characterized absorption of the porpyrin cofactors. b, Form I 
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MccA as isolated shows all haem groups in the oxidized state, with copper 
present as Cu(I) (black trace). Reduction with 100-fold molar excess of Ti(IID 
citrate in a stopped-flow spectrophotometer rapidly leads to the emergence 
of a typical ferrohaem spectrum (purple). For oxidation of reduced protein 
see Extended Data Fig. 8. c, Kinetic traces at 552 nm (red) and 600 nm (blue) 
show that the rise of the «-band at 552 nm is not accompanied by a relevant 
change at 600 nm. 
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Extended Data Figure 8 | Reoxidation of W. succinogenes MccA with number of reduced haems can be deduced from the relative position of the 
sulfite. MccA as isolated in the oxidized state (black) and subsequently fully — «-band at 552 nm between the oxidized state (0 haems reduced, black) and the 
reduced by Ti(III) (purple) to be reactive towards sulfite. After removal of Ti(II)-reduced state (8 haems reduced, purple). This evaluation shows that 
excess reductant, the stepwise addition of sulfite to reduced MccA leads to sulfite only obtains up to 4 electrons from fully reduced MccA (inset right). 
successive oxidation of haem centres (inset left, grey spectra). However, evenat As the enzyme forms a trimer with 24 haem groups, a total of 12 electrons are 
a large excess of sulfite (250 mM) the oxidation of MccA is incomplete. drawn from the system, sufficient to fully reduce 2 sulfite molecules to sulfide. 


Assuming nearly identical spectroscopic properties for all haem groups, the 
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Extended Data Table 1 | Data collection and refinement statistics 


Data Set 


MccA form I (anoxic) 


MccA form II (oxic) 


PDB accession code 
space group 

a,b,c 
a, B, 


resolution limits [A] 


cell constants 


completeness (%) 
unique reflections 


multiplicity (%) 


* 


4RKM 
P2, 


[A] 211.1, 103.0, 233.1 
°] 90.0, 98.5, 90.0 


49.37 — 2.20 
(2.24 — 2.20) 


100 (99.9) 
500,807 (24,733) 
6.6 (6.4) 


4RKN 
£3 


186.5, 186.5, 232.8 
90.0, 90.0, 120.0 


47.22 —2.10 
(2.14 — 2.10) 


99.8 (96.6) 
175,765 (8,426) 
9.4 (5.8) 


Rmerge 0.199 (1.774) 0.102 (0.430) 
Rate” 0.084 (0.758) 0.035 (0.188) 
mean I/ o(1) 9.2 (1.2) 17.3 (3.6) 
CCip” 0.992 (0.506) 0.998 (0.843) 
refinement statistics 

Riga" 0.18 0.16 
Reree (Yo) 0.22 0.18 
non-hydrogen atoms 70,014 24,154 
solvent molecules 3,144 1,464 
Cruickshank’s DPI *° 0.246 0.205 
figure of merit 0.802 0.901 
r.m.s. deviations from ideal values 

bond lengths (A) 0.009 0.007 
bond angles (°) 1.3 1.3 
average B values (A’) 

protein main chain atoms 38.98 2551 
protein all atoms 39.98 26.03 
solvent 35.65 30.43 


References 34-36 are cited within the table. Riree is the cross-validation R value for a test set of 5% of unique reflections®’. 


* Rerge = Lnki [ 
+ Reryst = Zhe | 


= [4 - | ys, y 
Foos | ~ | Featc | |/Shat | Fobs | 
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Extended Data Table 2 | Kinetic parameters for MccA 


enzyme substrate Viens Km Keat Keat/Km 

form I (anoxic) SO; 151.340.5 43.942.3 200.6 3.44 
NO. 11.6+0.3 44.0+5.0 16.1 0.37 

form II (oxic) SO; 71.8+0.5 63.949.4 99.5 1.55 
NO. 5.1+40.2 39.9+4.9 Td 0.18 


Kinetic parameters were recorded at 303 K. Units for Vmax are pmol min~ +mg~?; Ky is in 1M; Keat is in s~+; and for keat/Ku units are »Ms~ 
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human ground state naive 


pluripotent stem cells 
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Tamar Shwartz, Shlomit Gilad, Daniela Amann-Zalcenstein, 
Sima Benjamin, Ido Amit, Amos Tanay, Rada Massarwa, 
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Nature 504, 282-286 (2013); doi:10.1038/naturel2745 


The reduced representation bisulphite sequencing (RRBS) data gen- 
erated and analysed in this Letter were not originally uploaded to the 
Gene Expression Omnibus (GEO), but can now be found under acces- 
sion number GSE52617 (within the GSE52824 super series). As can be 
seen in this RRBS data, mouse naive pluripotent cells expanded for an 
extended period (that is, 90 days) in naive 2i/LIF (inhibition of ERK1/2 
and GSK38 signalling plus leukaemia inhibitory factor)) conditions, 
globally lose DNA imprinting. 
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CORRECTIONS & AMENDMENTS 
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Corrigendum: Deterministic direct 
reprogramming of somatic cells to 


pluripotency 


Yoach Rais, Asaf Zviran, Shay Geula, Ohad Gafni, 

Elad Chomsky, Sergey Viukov, Abed AlFatah Mansour, 
Inbal Caspi, Vladislav Krupalnik, Mirie Zerbib, Itay Maza, 
Nofar Mor, Dror Baran, Leehee Weinberger, Diego A. Jaitin, 
David Lara-Astiaso, Ronnie Blecher-Gonen, Zohar Shipony, 
Zohar Mukamel, Tzachi Hagai, Shlomit Gilad, 

Daniela Amann-Zalcenstein, Amos Tanay, Ido Amit, 

Noa Novershtern & Jacob H. Hanna 


Nature 502, 65-70 (2013); doi:10.1038/nature12587 


In this Article, the reduced representation bisulphite sequencing (RRBS) 
data obtained and analysed were not initially uploaded to the Gene 
Expression Omnibus (GEO) record, but can now be found under acces- 
sion number GSE641 15, within the GSE49767 super series. Furthermore, 
gene expression data for two samples (Mbd3*/* embryonic stem cells 
and Mbd3*'* mouse embryonic fibroblasts (MEFs) after 8 days on doxy- 
cycline (dox)) were generated previously by our group’ and deposited 
in the GEO under accession numbers GSM874650 and GSM874654. 
Further analysis of this data set was included in Figs 3a, 5c and Extended 
Data Fig. 8a, c of this Article. Also, in Fig. 5c and Extended Data Fig 8c 
of this Article, the ‘Mbd3*’* MEF + 8 day dox’ sample was inadver- 
tently mislabelled as ‘Mbd3"’* MEF + 11 day dox’. This change does 
not influence the conclusions of the manuscript. 

In addition, further details on the experimental settings used to 
generate the data in the GSE49766 series reported in this Article (under 
the GSE49767 super series) have been added to the GEO website. It now 
states that in Fig. 3a and Extended Data Figs 5 and 8, out of 20 inde- 
pendent clonal series generated in our study carrying either the GOF18 
APE-Oct4-GFP transgenic reporter (Addgene plasmid 52382) or the com- 
plete GOF18 Oct4-GFP transgenic reporter (Addgene plasmid 60527), 
the clonal series selected for genomic analysis included an Mbd3*/* 
clone that carries the GOF18 APE-Oct4-GFP transgenic reporter, and 
Mbd3"*~ and Mbd3 ‘~ cells that carry the GOF18 Oct4-GFP trans- 
genic reporter (complete Oct4 enhancer region with distal and proximal 
enhancer elements). These reporters can be identified when analysing 
the Oct4 locus in genomic DNA input datasets. As we do not use Oct4— 
GFP or any other selection for sorting cells before conducting genomic 
experiments, the difference in transgene reporters would not influence 
the interpretation of our genomic analysis data in any way. In these 
genomics studies, the endogenous Nanog and Oct4 loci are not manip- 
ulated and are identical between all cell lines because the Oct4—-GFP 
reporters were introduced via random transgenesis and validated for 
specificity. Notably, for the mouse induced pluripotent stem (iPS) cell 
efficiency results presented in our Article (for example, in Figs 1 and 2 
and Supplementary Videos 1-4), cell lines carried the matched Nanog- 
GFP knock-in reporter or Oct4d—GFP transgenic reporter (containing 
both distal and proximal enhancer elements as delineated in Extended 
Data Fig. 3a). Thus, all iPS cell efficiency and kinetic comparisons were 
conducted by using matched and validated pluripotency reporter systems. 


1. Mansour, A.A. et al. The H3K27 demethylase Utx regulates somatic and germ cell 
epigenetic reprogramming. Nature 488, 409-413 (2012). 
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Global-health specialist Adrian Hill examines a candidate Ebola vaccine for a ground-breaking clinical trial. 


PUBLIC HEALTH 


Behind a vaccine 


A burgeoning vaccine industry paves paths for scientists to 
spread their knowledge outside the laboratory. 


BY BRYN NELSON 


Health Organization (WHO) called 

global-health specialist Adrian Hill, who is 
the director of a non-profit vaccine-research 
centre. They had an urgent question: how soon 
could the centre launch a clinical trial for an 
Ebola vaccine? 

“They weren't talking months — they were 
talking weeks, if not days,’ says Hill, who works 
at the Jenner Institute in Oxford, UK. A non- 
profit partnership between the University 
of Oxford and the animal-health-focused 
Pirbright Institute in Woking, UK, the Jenner 


[ August 2014, officials from the World 


Institute had vaccine-research programmes in 
progress that targeted nearly 20 human and 
veterinary diseases — but not Ebola. 

Yet within a month of the phone call, the 
institute had launched an Ebola-research 
initiative. And six months later, it was testing 
candidate vaccines, including one ina phase III 
trial that involved 27,000 people in Liberia. 

The rapid implementation of this 
programme underscores the swift pace 
and nimble nature of vaccinology, in sharp 
contrast to other disciplines in which it can 
take decades to gather results and create 
practical applications. And although priori- 
ties can shift rapidly, the pressing nature of 
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outbreaks means that researchers can see their 
ideas implemented quickly to tackle a major 
disease. “There are not many fields where 
you can go in as a graduate student to a lab 
and finish up four years later, and something 
you've made with your own hands at the bench 
is being used to immunize people in a clinical 
trial that you are part of} Hill says. 

Early-career scientists with an eye on 
vaccinology do not necessarily need a PhD. 
Because the field is geared towards translating 
research into practice (see “Vaccination cam- 
paigns must deliver trust first’), many employers 
value practical lab experience and a humanitar- 
ian mindset, and are willing to provide on-the- 
job training to promising candidates. 

Getting a vaccine from the lab to the clinic 
requires an approach that includes fields 
from microbiology to chemical engineering. 
“One of the attractions for a young person 
coming into this field is that it’s so diverse,” says 
Hill. “You can head off in different directions.” 
Opportunities exist throughout industry, the 
non-profit sector and academia. 


VACCINATION CONTAGION 

The global vaccine market has boomed in the 
past couple decades. Between 2000 and 2013, 
the market value for vaccines soared from 
US$5 billion to almost $24 billion; by 2025, that 
value is expected to quadruple. Groups around 
the world are trying to devise effective vaccines 
for dozens of diseases, especially the ‘big three’ 
— HIV/AIDS, malaria and tuberculosis. 

In 2000, the Bill & Melinda Gates Foundation 
in Seattle, Washington, began pouring money 
into the development and distribution of vac- 
cines for economically disadvantaged nations. 
After 11 September 2001, the US and other gov- 
ernments began to prioritize vaccine research 
for diseases that could be used in bioterrorism. 
And the WHO and other aid organizations have 
helped to drive home the necessity of control- 
ling diseases in developing countries, such as the 
ongoing Ebola outbreak in West Africa. 

Novavax, a vaccine research-and-develop- 
ment company in Gaithersburg, Maryland, has 
more than tripled in size over the past 4 years 
to roughly 300 employees. “Novavax is hiring 
like crazy,’ says Christi McDowell-Patterson, 
director of upstream process development 
for the company. Her department, a mix of 
chemical engineers, cell biologists and other 
scientists, manages the cell lines and equip- 
ment used to create the company’s vaccine 
candidates, including ones for Ebola, influenza 
and respiratory syncytial virus. 
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> Novavax also hires summer interns — 
mainly undergraduates or recent graduates 
with bachelor’s degrees. “If we like them, 
we'll try to figure out a way to keep them on,” 
McDowell-Patterson says. Novavax, along 
with other companies and organizations — 
such as Sanofi, Novartis and the International 
AIDS Vaccine Initiative — offers tuition 
reimbursement to support graduate education 
for promising young scientists. 

Apart from the many perks Novavax offers, 
ranging from yoga classes to a bowling league, 
chemist Natalie Thompson was most intrigued 
by the company’s production methods, which 
use virus-like particles and nanoparticles 
composed of recombinant proteins. She joined 
the company in March 2014 after completing 
a PhD in analytical chemistry and a three- 
year postdoc during which she used mass 
spectrometry to study monoclonal antibodies. 

At Novavax, she employs analytical methods 
such as liquid chromatography and mass 
spectrometry to determine the constituents 
and quality of the company’s vaccine products. 
“What I enjoy about working in industry is that 
the final goal is really defined,’ she says. She also 
likes the fast pace, and that many different teams 
work together. 


A SYMBIOTIC RELATIONSHIP 

Many vaccine-development organizations 
have set up shop near universities to take 
advantage of local academic expertise, 
which means that opportunities for budding 
vaccinologists are often steps away. Gerald 
Strohmaier, global head of industrial relations 
and corporate finance at the biotechnology 
company Valneva, regularly draws talent from 
the University of Vienna, which hosts the com- 
pany’s main research facility. “We do not havea 
huge scouting department looking for talent all 
over the world,” he says. “But whatever we can 
do with students, we do.’ Although Valneva 
hires only a few postdocs at a time, scientists 
who score a position have excellent job 
prospects — as many as 80% of them secure 
permanent jobs with the company. 

Whereas pharmaceutical and biotech 
companies focus on more-marketable vaccines, 
many non-profit institutes are researching 
vaccines that have less commercial value. Sala- 
ries at these institutes are generally lower than 
those for university or industry jobs, but people 
do have the potential to make a humanitarian 
difference in developing countries. 

The International Vaccine Institute (IVI), 
for example, is an independent organization at 
Seoul National University in South Korea that 
creates and introduces vaccines for neglected 
infectious diseases. One of the institute’s 
most-successful developments is a low-cost oral 
vaccine for cholera. But that vaccine requires 
two doses given two weeks apart, and IVI 
researchers are in the process of analysing 
data from a clinical trial of 200,000 people in 
Bangladesh to gauge the effectiveness of a 
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WORDS BEFORE NEEDLES 


Vaccination campaigns must deliver trust first 


Partnerships with health centres in developing 
countries help to immunize local children. 


Vaccine hesitancy — the reluctance to 
receive or engage in vaccination — is 
insidious and alive. In northern India, a 
polio-vaccination campaign had to dispel 
rumours about potential side effects. The 
rumours had been spread by parents who 
had objected to strangers vaccinating their 
children. In Kenya, religious leaders who had 
not been consulted before a tetanus-vaccine 
initiative revived a 20-year-old rumour that 
the vaccine would sterilize children. 

Heidi Larson has seen countless 
permutations of this issue. So accustomed 
was she to dealing with it during her work at 
the United Nations children’s charity UNICEF 
in New York City that she became known as 
the director of UNICEF’s fire department. 
The organization sent her around the 
world to extinguish the fires of distrust and 
suspicion that often impede the delivery and 
acceptance of desperately needed vaccines. 

An anthropologist by training, she had 
worked extensively on HIV and AIDS issues 
affecting adolescents in Nepal and the 
South Pacific. “I really had thought coming 
from AIDS into vaccines would be a bit 
bland,” she says. “To my surprise, this whole 
area of the introduction of new vaccines saw 
more emotional and political challenge than 
| ever could have imagined.” 

Director of the Vaccine Confidence 
Project at the London School of Hygiene 
and Tropical Medicine, Larson is mapping 
the social, cultural and political challenges 
to vaccination efforts around the world, from 
northern Nigeria to southern California. 

Skills in vaccine introduction, risk 
management and communication are often 
overlooked, but groups such as UNICEF 
and the World Health Organization are 
increasingly recognizing their value. As a 
result, career opportunities are emerging 
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for good communicators with a desire to 
combine science and public service, helping 
vaccination teams to earn the support 

of local communities. “I have more and 
more people contacting me who want to 

do masters and PhDs with our group and, 
at the same time, | have more and more 
demand for this type of work,” Larson says. 
Some organizations are so desperate for 
help that Larson is developing a training 
course that can be completed more quickly 
than a masters or PhD. 

Scientists can also do a graduate 
programme in public-health 
communication, and the US National Public 
Health Information Coalition and the Virtual 
Immunization Communication Network 
have compiled tools and training options. 

Larson is leading the communication 
and preparedness effort for an Ebola- 
vaccine trial in Africa that will involve an 
initial shot and follow-up boost. She and 
her colleagues are engaging community 
leaders to work out acceptable procedures, 
from how to seek informed consent from 
the local population to how to distribute and 
administer the vaccine. 

Even small misunderstandings can have 
profound consequences. The head of one 
vaccine programme related how ignorance 
of a local custom — addressing religious 
leaders and elders with honorifics to show 
respect — probably contributed to a year- 
long delay in getting approval for a mass 
typhoid-vaccination campaign in south Asia. 
When a colleague corrected the misstep, 
approval was gained in a few weeks. 

For a vaccine trial in Gambia, one of 
Larson’s graduate students modified the 
informed-consent process to align with the 
region’s oral tradition, replacing written 
documents with an interactive video in the 
local language. The approach enhanced 
comprehension and retention, especially for 
people who had low levels of literacy. 

“There’s a real science behind how you 
introduce a new vaccine. It just doesn’t 
happen by accident,” says Peter Jay Hotez, 
president of the Sabin Vaccine Institute and 
Texas Children’s Hospital Center for Vaccine 
Development in Houston. Beyond building 
partnerships, it also requires engaging in 
public dialogue. 

Despite the dearth of formal programmes 
focused on vaccine outreach and 
communication, Hotez and other researchers 
say that those who have the skills and 
diplomacy can be crucial assets for ensuring 
that a vaccine-development scheme does not 
fall short of its goal. BN. 
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one-dose version to improve compliance. 
The institute is always on the look- 
out for research talent, says Sushant 
Sahastrabuddhe, a physician who heads 
the institute's Enteric and Diarrheal Disease 
Programme. Its international scientific staff 
of about 50 includes researchers with PhDs, 
medical doctorates and master’s degrees. 

The formation of partnerships between 
non-profit organizations and developing 
countries is also creating job opportu- 
nities. The Infectious Disease Research 
Institute (IDRI) in Seattle, Washington, 
employs about 125 people and works with 
nearly 100 collaborators around the world. 
It has helped to establish vaccine-formu- 
lation centres in South Africa and India. 
Manufacturing vaccines locally instead of 
importing them can decrease both the cost 
and distrust of the product, says Steven 
Reed, IDRI’s founder. 


The need to bol- 
ster public-health The pressing 
infrastructure nature of 
in developing outbreaks 
regions has _ meansthat 
become increas- researchers can 
ingly important. see their ideas 
“There's minimal, implemented 
almost zero, quickly. 


capacity to develop 
vaccines in the 
Middle East, north Africa and sub-Saharan 
Africa,’ says Peter Jay Hotez, president of 
the non-profit Sabin Vaccine Institute and 
Texas Children’s Hospital Center for Vaccine 
Development in Houston. “And yet these are 
the places where we're going to see the next 
generation of catastrophic emerging infec- 
tions.” The US Department of State named 
Hotez asa US Science Envoy, a role designed 
to help promote international partnerships. 
For his project, he is pushing to expand 
vaccine infrastructure in Africa and the 
Middle East by focusing on countries such 
as Saudi Arabia and Morocco, which have an 
established scientific culture and PhD-level 
scientists. Ifhe succeeds, programmes that 
increase vaccine production in these places 
could lead to more jobs for local scientists, as 
well as for international consultants. 

For him, vaccinology provides the 
“perfect confluence” of humanitarian 
values and biomedical and social science. 
And as Hill can attest, the work is seldom 
boring. The past six months have been par- 
ticularly “invigorating and energizing’, he 
says, with regular enquiries about the Ebola 
trials from the British Prime Minister’s 
office and other top government officials. 
“Going to the highest level of government 
with what you assayed yesterday,’ he says, 
“is pretty exciting.” m 


Bryn Nelson is a freelance writer based in 
Seattle, Washington. 


WORKPLACE CLIMATE 


Metrics for ethics 


Focus on perceived working conditions could help graduate 
schools to train responsible researchers. 


BY MONYA BAKER 


raining in research ethics is mandatory 

for many US graduate students and 

postdocs, but there is little evidence that 
formal classes prompt scientists to conduct 
research ethically. However, the workplace 
climate — which includes perceptions of regu- 
latory committees, data confidentiality and 
treatment of trainees — influences research 
practices and can spawn behaviours such as 
poor record-keeping or plagiarism. 

An interdisciplinary team has developed a 
survey to assess work conditions in research 
institutions, with a long-term goal of establish- 
ing a baseline for measurements of workplace 
climate across disciplines and universities. The 
SOuRCe (Survey of Organizational Research 
Climate) is a 32-question survey that divides 
workplace climate into seven categories, 
including integrity norms (such as giving due 
credit to others’ ideas), integrity inhibitors (such 
as inadequate access to material resources) and 
adviser—-advisee relations. The team hopes that 
such data will help institutions to craft policies 
that will improve research conduct. 

The survey illuminates differences in 
attitudes held by faculty members and trainees, 
says Karen Klomparens, dean of the graduate 
school at Michigan State University (MSU) 
in East Lansing. When the school ran ver- 
sions of the survey in 2009 and 2014, clusters 


ETHICS IN THE ENVIRONMENT 

The level of self-reported fraud, fabrication and 
plagiarism increases as perceptions of ‘integrity 
norms’ fall and of ‘integrity inhibitors’ rise. 


{Integrity norms Integrity inhibitors 
(favourable) (unfavourable) 
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Perceived prevalence at workplace 
(1 = “not at all” and 5 = “completely”) 


*Predicted likelihood that an individual would report engaging in fraud, 
fabrication or plagiarism, according to regression analysis of a large survey 
on research behaviour and workplace climate during the past three years. 
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of respondents reported feeling ill-equipped 
to judge whether university policies support 
responsible research — which suggests that 
those topics are not discussed in meaningful 
ways, she says. Klomparens used the results to 
spur faculty members in specific departments to 
talk to trainees about norms in authorship, data 
management and peer review. “Because we 
use the survey data by graduate programme 
and by discipline, we can make recommenda- 
tions,” she says. To encourage participation, she 
emphasized to respondents that the tool is not 
intended to shame or punish, and responses are 
stripped of identifying information. 

Brian Martinson studies research integrity 
at the non-profit HealthPartners Institute for 
Education and Research in Bloomington, 
Minnesota, and helped to develop the survey 
at 40 academic health centres (B. C. Martinson 
et al. Sci. Eng. Ethics 19, 813-834; 2013). He 
has also worked on it in a separate project 
with MSU, Pennsylvania State University and 
the University of Wisconsin-Madison. A 
poor workplace climate correlates with many 
undesirable research behaviours, even extreme 
forms such as data falsification, he explains 
(see ‘Ethics in the environment’). Still, he 
thinks that institutions could boost integrity 
most effectively by focusing on common, less- 
attention-grabbing behaviours that are tightly 
tied to workplace climate, such as sloppy record- 
keeping. “They lower the standards that people 
are following over the long run,” he says. 

The dearth of robust, real-world research has 
long hampered efforts to improve integrity, says 
C. K. Gunsalus, director of the National Center 
for Professional and Research Ethics in Urbana, 
Illinois, who is working with Martinson’s team 
to distribute the study online. “The climate sur- 
vey provides actual empirical data,” she says. 

Gunsalus and others aim to aggregate 
results in a central database, so that a physics 
department at one institution, for instance, will 
be able to compare its climate scores with those 
of similar departments elsewhere. 

Analysing workplace climate could be a 
powerful way to promote integrity, says 
Nicholas Steneck, a consultant for the US Office 
of Research Integrity in Rockville, Maryland, 
although he fears that institutions might find 
it tough to apply SOuRCe results to policy. But 
Gunsalus thinks that broad, quantitative survey 
results could make the issue more difficult to 
neglect. “The best thing that gets traction with 
scholars and scientists,” she says, “is data.” m 
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THE ARCHIVE PERSONALITY PROTOCOL 


BY BRIAN TRENT 


¢C C ou can't possibly be drunk, Cynthia! 
Ye been out of cryosleep for 
less than two hours!” 

Cynthia Szabo reluctantly turned her gaze 
from the ship’s viewscreen to regard her 
inquisitor. “Did you say something?” 

Liza Neary’s lips curled into a snarl and 
she was about to retort when the viewscreen 
flared again. The colony fleet — 20 massive 
Stanford torus vessels launched from Earth 
300 years earlier — spread and contracted like 
a shoal of fish. Little explosions lit the void 
between them; a macabre light show from 
humanity's grand attempt at galactic diaspora. 

“You look drunk,’ Liza breathed, bits of 
cryofrost still dangling from her eyebrows 
and dreadlocks. “Great. Just great, Cynthia.” 

“Tm not drunk,’ Cynthia slurred. “I’m 
merely rattled, okay? Three centuries of 
cryosleep, and this is what I wake up to!” 

Liza laughed bitterly and squeezed the ends 
ofher dreads. “We knew that the descendants 
of the original colonists would inevitably 
form their own societies. Knew that theoreti- 
cally that might result in conflict, but...” 

“How bad is it?” 

“Want the laundry list?” 

Cynthia nodded reluctantly. 

“The Thebes went communist,’ Liza said, 
pointing to one of the many silver flecks on 
the screen. “They've sent a boarding party 
against the Sparta, which, while we were 
sleeping, became a loose confederation of 
libertarian enclaves. The fighting is a messy, 
level-by-level grudge match now.” 

“Why isn’t the Castor helping? They're the 
flagship of the entire fleet ...” 

“Sure, and as the flagship, the Castor was 
settled by Earth’s major entrepreneurial 
dynasties. Well take a look: Liza tapped the 
screen anda real-time video bubble opened, 
displaying a feed of the Castor’s agora deck. 
Sweaty, wearied labourers moved in endless 
lines, hauling materials through a colour- 
coded maze of aisles, while uniformed offic- 
ers monitored them from a cage-walk above. 

Liza explained: “In 300 years, the Castor’s 
plutocratic elite formed monopolies on air, 
water and food supplies, so now they leverage 
their ‘customer base’ to gain a fraction of stock 
advantage against their corporate enemies.’ 

Cynthia burped noisily, earning a scathing 
glare from her co-councillor. Three centuries! 
she thought. Seemed like only yesterday that 
she was climbing into her cryocapsule with 
the rest of the Emergency Protocol Council, 
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A great way to resolve a problem. 


Earth still visible through the porthole. She 
had expected to sleep the whole way, the 
cryosleep suspension gas sinking her into a 
grey and restful slumber until... 

She shook herself free of the memory. 
“Liza, youre not going to tell me the entire 
fleet has devolved into a free-for-all?” 

Liza rubbed her chin. “No, that wouldn't 
be accurate. We've got two ships that reverted 
to anarcho-primitivism, with genome clans 
decrying all technology and raising their 
kids to believe that their seedships are the 
entire world. Then there’s the Pollux...” 

She opened another video bubble. A mass 
wedding ceremony was taking place. But 
something was wrong with all the faces... 

“One eye is two inches higher than the 
other,” Cynthia gasped. “What the hell?” 

“Tve been calling them the cubists. Their 
ancestors apparently decided that asymme- 
try is the epitome of beauty, so they imple- 
mented a merciless breeding programme. 
The result, over the past three centuries, has 
been these charming little Picasso people. 
Anyone born with symmetrical features is 
ground into mulch? 

Cynthia felt sick. 

“The rest of the fleet collapsed into 
oligarchies, despotisms, not one but two 
feudal states that make sixteenth-century 
Japan look like Disneyworld, four theocra- 
cies, a sopranocracy —” 

“Should I even ask?” 

“Rule by those with the highest-pitch 
voices. Sounds like a ship-wide helium leak” 

Cynthia couldn't take it anymore. She 
pulled the hidden flask of vodka from her 

cryosuit pocket and 
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“Explains what? That humanity’s grand 
attempt at colonizing the stars has become 
this free-for-all?” 

“Thawed for two hours, and you still found 
time to get blitzed. Just great, Cynthia” 

“Hey, at least I did something! The moment 
I realized the fleet had degenerated into chaos, 
I enacted the Archive Personality Protocol!” 

Liza started to reply when suddenly two of 
the ships engaged in hostilities ceased firing. 
Another pair of ships followed suit. 

“As I was saying,’ Cynthia continued, “the 
protocol seemed like the only option. It’s —” 

“A storage of reconstructed personalities, 
to help us deal with any possible contin- 
gency when we reach our destination,” Liza 
snapped. “I know. Benjamin Franklin, Confu- 
cius, Saladin, Catherine, Marcus Aurelius...” 

“I decided to use it now, to restore order to 
the fleet.” She trailed off, as more and more 
ships began to converge into a tight, disci- 
plined formation. The colourful bursts of 
weapon fire were winking out. 

“Ha!” Cynthia exclaimed, the gleam of 
triumph in her inebriated eyes. “It worked! 
Good ol’ President Hamilton was able to talk 
sense into everyone!” 

Liza stared at her. “President Hamilton?” 

“T figured he was the most innocuous, 
effective choice. He helped to stitch the 
divided American colonies into a cohesive 
nation, so...” 

Liza checked the archive list of recon- 
structed personalities. “Alexander Hamil- 
ton?” she repeated. 

“Yes, goddam it.” 

“You were drunk,” Liza said, shaking 
her head. “You probably meant to conjure 
Mr Hamilton, but it looks like your finger 
slipped and selected the name above his.” 

Cynthia felt her smile melt. At that moment, 
a voice erupted from the ceiling speaker. It 
spoke in a language she didn’t recognize. 

Liza hit the TRANSLATE option, just in 
time to catch the tail-end of the message: 

“— have failed to lead yourselves, and now I 
shall lead you. We shall conquer the very stars, 
world by world, and none shall oppose our uni- 
fied might! Under my rule, we shall remake the 
very galaxy! I, Alexander, swear this to you!” 

“Alexander?” Cynthia muttered. “The —” 

“Great.” Liza snatched the flask from her 
hand and downed a mighty swig. “Just great, 
Cynthia.” = 


Brian Trent is a science-fiction writer 
and traveller. He lives in New England, 
somewhere between a city and a forest. 
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ver the past few decades, China has enjoyed economic 

prosperity, driven by its vigorous manufacturing 

output. More recently, the country has used this 
wealth to ratchet up other skills and now has the world’s 
second largest spend on research and development (see page 
S8), and the second largest output of scientific papers. 

Behind these accomplishments are the incentives that 
drive good scientific behaviour. Having lost a generation of 
scientists during the Cultural Revolution, the country has 
made a huge effort to instil qualities that are in line with those 
of other developed countries. These issues were examined at 
a symposium held in Shanghai in October 2014, co-hosted 
by Nature. Representatives from academia, industry and 
scientific societies talked about the importance of research to 
China’s ongoing development, and how to stimulate it (S10). 
This Outlook was influenced by the debates, although Nature 
takes sole responsibility for all editorial content. 

Unlike countries such as the United Kingdom and 
Australia, China does not have a national process of research 
assessment. The closest comparison is the evaluation 
undertaken by its largest research organization, the Chinese 
Academy of Sciences (S18). 

Any complete measure of science must also recognize new 
technology, improved health and job creation — but Chinese 
researchers face institutional obstacles in translating their 
research into industrial applications (S28). 

At the heart of this progress are China’s scientists, yet many 
of them are struggling to find their way in a system that is 
undergoing such extensive change. In particular, young 
researchers (S36) and female scientists (S26) crave policies 
that would enable them to make a greater contribution. Such 
improvements may fortify China's scientific acumen and 
make the country a formidable player in global research. 
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A CENTURY-OLD CHINESE UNIVERSITY 
ASPIRING FOR EXCELLENCE 


ounded in 1902, Nanjing University 

(NJU) is over a century old. It enjoys 

an excellent reputation in China and 
abroad. The historical development of the 
university has been closely connected with 
the mission of the nation and the needs 
of society. NJU's vision is to develop into a 
world-class, research-intensive and _inter- 
nationalized higher-education institution 
oriented toward high-calibre faculty, top- 
notch students and cutting-edge research. 


ts president is Jun Chen, a professor in 
geochemistry and an academician of the 
Chinese Academy of Sciences. Presently, 
NJU has over 33,205 full-time students, of 
whom about 14,188 are undergraduate stu- 
dents, 15,667 are master and doctoral stu- 
dents and 3,350 are international students. 
NJU has three beautiful campuses: Gulou, 
Xianlin and Pukou. It has 28 schools con- 
taining 73 departments and has 8 national 


key disciplines in the primary disciplinary 
categories, 13 in secondary categories and 
6 in the nurturing list. The university is listed 
in the first group of high-level universities 
in the national 211 and 985 projects, which 
attract strong funding from the Chinese 
government. As a member of the China 
Nine League, NJU boasts strong faculty in 


the humanities, social sciences, natural sci- 
ences, medicine and engineering. 


Faculty 
In its over 100 years’ history, NJU has culti- 
vated a great number of prominent and 


learned figures, and has consequently 
greatly contributed to the nation’s revital- 


ization and development. Many famous 


Advertiser retains sole responsibility for content 


scientists and scholars have studied or 
worked at the university. Among its 2,25 
faculty members, 28 are academicians of 
the Chinese Academy of Sciences, 3 are 
academicians of the Chinese Academy of 
Engineering, 1 is a foreign academician of 
the Chinese Academy of Sciences, 4 are 
academicians of the World Academy of 
Sciences, 
Academy of Sciences and 1 is an academi- 
cian of the Academy of Science of the Roya 
Society of Canada. In addition, there are 115 
Chair Professors and Distinguished Visiting 


is an academician of the Russian 


Professors of the Chang Jiang Scholars 
Program, 97 winners of the National 
Distinguished Young Scientists Award, 
29 overseas scholars under the National 
Thousand Talents Plan and 45 awardees of 
he National Youth Thousand Talents Plan. 


Research 

JU is a leading comprehensive univer- 
sity in China as assessed by various national 
and_ international rankings. According 
o Thomson Reuters’ Essential Science 


ndicators, 13 academic disciplines at NJU 
are ranked in the top 1 per cent worldwide 
and are ranked top among Chinese univer- 
sities. In the QS Word University Rankings 
2014, NJU has 20 disciplines listed in the 
global top 200. 
Over the past decade, NJU has acquired 
more than 800 national, provincial and min- 
sterial awards for research. Since 2000, NJU 
has won 1 first prize and 18 second prizes 
in the National Natural Science Awards, 
6 second prizes in the National Scientific 


and Technological Progress Awards and 4 


NANJING UNIVERSITY 


second prizes in the State Technological 
Awards. In 2006, a project accomplished by 
JU physicists led by academician Nai-ben 
ing won first prize in the National Natural 
Science Awards, which is the highest award 
for natural scientific research in China. 

n recent years, NJU has undertaken 
dozens of state key projects. In addition, 
JU ranks top among Chinese universities 


in terms of the quality and quantity of re- 
search papers listed in the Science Citation 
ndex (SCI). The number of SCI publications 
of NJU was ranked first among Chinese uni- 


versities for seven years since 1992, and NJU 
had the highest number of citations of SCI 
papers for eight years since 1992. In 2013, 
JU was ranked fourth among Chinese 
universities in the Nature Publishing Index. 
n 2014, NJU was ranked 38th in the Global 
ature Index and fourth for China. In 2015, 
U was ranked 7th in the Asia-Pacific 
ature Index, and again fourth for China. 


= 


n addition to conducting basic scien- 


ific research, NJU actively pursues applied 
scientific and engineering research to pro- 
mote economic and social development. 
JU has established 23 platforms that 
involve cooperation among government, 
industry and university, to boost industry— 


academia interaction and the local econ- 
omy. These platforms are for areas such as 


new materials, electronic data, biopharma- 
ceuticals, environment and resources and 
fine chemicals. 


International cooperation 
NJU is very active in international coop- 
eration and academic exchanges. It has 
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established partnership with over 280 
institutions of higher learning and research 
institutes from over 30 different countries. 


JU continually pursues opportunities for 
cooperation. Such efforts 
resulted in the establishment of the Center 


international 


for Chinese and American Studies with 


Johns Hopkins University, the Sino—German 
nstitute for Law Studies with the University 
of Géttingen and the Center for Chinese 
and Japanese Studies with the University 
of Tokyo. Many world-renowned scholars, 
including Nobel laureates, are honorary or 
visiting professors of NJU. In 2011, Aaron 
Ciechanover, who was awarded the Nobel 
Prize for chemistry in 2004, accepted an in- 


vitation to serve as Director for the Institute 
of Chemistry and Biomedical Sciences 
of NJU. In 2013, Jean-Marie Gustave Le 
Clézio, who was awarded the Nobel Prize 
for literature in 2008, began to work at NJU 
as Distinguished Resident Scholar of the 


Institute of Advanced Studies in Humanities 


and Social Sciences of NJU. 


Research highlights 
Physical sciences 

A briefing in Nature (389, 113-117, 1997) 
assessed the Laboratory of Solid State 
icrostructures of NJU as “approaching 
world-class standards in research” Of the 
numerous achievements at NJU, perhaps 
he most influential was that by Nai-ben 
ing on the dielectric superlattice (DSL). 
For this work, he and his team members 


were awarded first prize in the Nationa 
atural Science Awards in 2006. 

The team investigated the fundamen- 
al theory of domain engineering and 
quasi-phase-matching and applied it in 


he design, fabrication and application 


of DSL materials. In recent years, Shi-ning 
Zhu's group successfully developed and 
commercialized a series of DSL based 
solid-state lasers, including mid-infrared 
unable lasers, which have important ap- 
plications in non-invasive medical diag- 
nostics, free-space communication, laser 
scalpels, spectroscopy and remote sens- 
ing. Zhu's team has also done cutting- 
edge work in the fields of plasmonics and 
integrated photonic circuits. In particular, 
Hui Liu and co-workers demonstrated 


mimicking of gravitational lensing and 

optical trapping in an optical chip. 
Excited by the striking properties and 

applications of DSLs in nonlinear optics, 


Zhu and Xu focused on the generation and 
manipulation of photonic entanglement 
— a key phenomenon in quantum tech- 
nologies. The unique properties of DSLs 
enable the photon flux and achievable 
wavelength to be increased and also offer 
a compact way to tailor the polarization, 
frequency and spatial entanglement. 

The team fabricated the first lithium 
niobate (LN) quantum chip that had ef- 
ficient two-photon sources and manipu- 
ated photons using components such 


as junctions and wavelength-selective 
filters. This chip is controlled by on-chip 
electro-optical modulators, resulting in a 
variety of quantum states. “An LN super- 
attice may become an ideal platform for 
g 
S 


” 


fully integrated quantum optics, 
Ping Xu. 


realizin 


explain 


Chemistry 

Nanomaterials and nanostructures are im- 
portant in many fields. As one of the earliest 
nanoscience groups in China, Hong-Yuan 
Chen and Jing-Juan Xu's group in the State 


Key Laboratory of Analytical Chemistry for 
Life Sciences has introduced nanotech- 
nology and biotechnology to the field of 
electrochemistry and has conducted sub- 
stantial experimental and theoretical stud- 
ies in ultramicroelectrodes, electrochemical 
biosensing, bioelectrochemistry as well 
as microfluidic chips toward various life- 
related substances. Their work about the 
basic study of microelectrodes has been 
included in the official “Technical Report” 
by IUPAC in 2000. Interested in a broad 
range of functionalized nanomaterials and 
patterned nanostructures, they have ex- 
ploited the unique electronic, mechanical, 
electrochemical and interfacial properties 


of these materials to address critical issues 
in both fundamental studies and real-world 
applications toward various  life-related 
substances for more than 20 years. Their ac- 
complishments have been recognized by 
national-level scientific awards. 

Their current research focuses on the 
in-depth study of the electrochemical and 
photophysical properties of nanomateri- 
als for the development of innovative 
bioanalytical techniques based on their 
photoelectric interconversion processes, 


ie, electrochemiluminescent (ECL) and 
photoelectrochemical (PEC) bioanalysis. 
They were the first to combine the energy 
transfer with the ECL of semiconductor 
nanocrystals system and proposed the 
dual-potential ECL sensing 
approach as well as the wireless bipolar 
electrode in bioanalysis. By means of the 
energy transfer processes between CdS 
nanocrystals and noble metal nanopar- 
ticles in PEC sensing systems, they further 
realized the bioanalysis with high sensitivity 


ratiometric 


and specificity. 
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Earth sciences 
Atmospheric dust plays an important role 
in the marine and terrestrial geochemical 
cycles and impacts the global climate on 
daily and orbital (10*-10° years) timescales. 
Asia is the second largest source of dust on 
Earth; the deserts and sand fields in China 
dominate Asian dust emission. Studies o 
dust transport based on dust tracers and 
satellite imagery clearly show that aeolian 
dust from Asia is transported globally. In the 
leeward regions of dust source areas, dus 
deposition provides an excellent archive 
for reconstructing the past environmenta 
changes since at least 22 million years ago. 
Jun Chen and his team have investigated 
the origin, transportation and deposition 
of Asian dust; they used the loess and Red 


Clay sediments in North China and the 
aeolian deep-sea sediments in the North 
Pacific Ocean to reconstruct past environ- 
ment changes during the late Cenozoic era. 

They have focused on three major areas. 
First, they established the spatial patterns 
of Nd-Sr isotope ratios, rare Earth element 
compositions and detrital mineral com- 
positions for the aeolian sediments in the 
deserts, sand fields, Gobi deserts, loess 
deposition areas and the Tibetan Plateau. 
Based on these results, they were able to 


delineate the source regions of Asian dust 
and found direct evidence for the close 
relationship between the isotopic compo- 
sition of desert deposits and the tectonic 
settings of the surrounding mountains. 

Second, by analysing Nd-Sr isotopes and 
the geochemistry of detrital monomin- 
eral, they found that the direction of short- 
range transportation of dust in central and 
eastern Asia is largely determined by the 
prevailing near-surface wind. Source trac- 
ing revealed that the Asian dust has two 


material sources: the northern margin of 
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the Tibetan Plateau and the Central Asian 
Orogen. This confirms the importance of 
mountain processes in the production of 
silt aeolian particles. 

Third, they found that all the sand fields in 
northern China, an important dust source 
today, were nearly completely covered by 
vegetation during the Holocene Optimum, 
whereas the deserts in northwestern and 


central northern China were about 5-20 
per cent smaller in area at this time. This 
suggests that the surface conditions of 
Asian dust source regions changed dra- 
matically during the glacial and interglacial 
climate variations, and such changes affect- 
ed regional dust emissions and subsequent 
feedback to the climate change. Thus, the 
dust impact should be coupled with the 
e 


climate system in palaeoclimatic mod 
ling in order to fully understand the forcing 
mechanism of the past climate changes. 

The unique processes of dust deposition 
and transportation in central and eastern 
Asia provide a key to understanding the 
evolution of the Earth’s environmenta 


systems during the late Cenozoic. The 
ambitious research project of Chen and his 
team focuses on investigating the globa 
dust emission, transportation, deposition 
and their climatic impact at daily to orbita 


timescales. They are using high-resolution 
and high-precision geochemistry and 
mineralogy techniques to pinpoint dus 
sources and look for new biogeochemistry 
proxy indices to better reconstruct pas 
climate and environment changes. Their 
research will significantly improve our un- 
derstanding of the behaviour of the Earth's 
surface environmental systems. 


Information sciences 
LAMDA, or‘Learning and Mining from Data’ 
is a research team led by Zhi-Hua Zhou, 


who is an ACM Distinguished Scientist 
and IEEE Fellow. He is affiliated with the 
ational Key Laboratory for Novel Software 
Technology of NJU; this laboratory has 
opped the rankings of the national key 


aboratories in computer science in China 
in the past two five-year terms. Zhou's team 


focuses on research of machine learning 
and data mining, with the aim of construct- 


ing computer programs that can analyse 
existing data and generalize to future cases. 

Zhou and his collaborators developed 
many innovative techniques that have 
been applied to various practical tasks in- 
cluding image retrieval, movie annotation, 
network monitoring, CPU design, medical 
diagnosis, gene pattern annotation and 
mobile photograph management. Their 


innovations have also been incorporated 
into scientific software such as Matlab and 
many open-source software systems. Part 
of their research was awarded second prize 
in the National Natural Science Awards of 
China. In that year Zhou also received the 
Annual IEEE CIS Outstanding Early Career 
Award, which is awarded to one outstand- 


ing researcher under 40 years old. 

When Zhou founded LAMDA ten years 
ago, there were only two faculty members; 
now the team has grown to 11 faculty 
and research staff. LAMDA seeks to attract 
overseas researchers as well as cultivate lo- 
cal talent. Zhou obtained all his degrees at 
NJU and is an excellent example of home- 
grown talent. 
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THE UNIVERSITY OF HONG KONG: 
INNOVATION, INTERNATIONALIZATION AND IMPACT 


he University of Hong Kong (HKU) 

is a comprehensive, research- 

focused institution, ranked as 
one of the top universities in Asia and the 
world. HKU scholars have made their mark 
locally, regionally and globally in a wide 
range of fields. The university supports their 
vital, cutting-edge research with an English- 
medium environment that promotes excel- 
ence and innovation and a commitment to 
making a positive impact on society. 

“AKU has fantastic research opportuni- 
ies, but we are not satisfied with leaving 
it at that’ says HKU president, Professor 
Peter Mathieson. “We need to take the 
ead on impact because it will make our 
research better and because we have a 
moral responsibility to use our knowledge 
o inform public debate” 


A firm foundation for success 

HKU attracts some of the best scholars 
and students from around the world, with 
nearly two-thirds of its professors and post- 
graduate students coming from outside 
Hong Kong. Researchers at the university 
have the highest citation rate per academic 
in Hong Kong. Moreover, in 2014, Thomson 
Reuters ranked 115 HKU scholars — one 
in ten scholars at HKU — among the 
op 1 per cent in their fields globally, and 
named 5 HKU scholars among the most 
influential in their fields. In addition, 15 HKU 
researchers have been conferred member- 
ship to the Chinese Academy of Sciences 
and the Chinese Academy of Engineering, 


which are among China's most prestigious 
academic bodies. 

HKU scholars have conducted ground- 
breaking research in fields such as influ- 
enza and infectious diseases, geology 
and geochemistry, economic policy and 
international finance, drug discovery, 
energy transmission, the modern history 
of China, and suicide prevention. In 2014, 
HKU won the most funding from compet- 
itive general research fund grants among 
universities in Hong Kong for the 13th year 
in a row. HKU scholars also coordinate 
he highest number of publicly funded, 
cross-institutional projects and oversee 
five Partner State Key Laboratories, which 
are key components of China's system for 
science and technology research. 


Sustaining excellence 
With so much raw talent, HKU is keen to 
encourage interdisciplinary collaboration. 
Its strategic research themes initiative 
provides seed funding in five key interdis- 
ciplinary areas: biomedicine, frontier tech- 
nology, the environment, China and the 
community. ‘Blue-sky research’ also remains 
central to HKU's strategy and receives 
substantial support. 
Regional and international collaboration 
is another key goal at HKU. The univer- 
sity collaborates with international centres 
such as the Institut Pasteur in France and 
the Karolinska Institutet in Sweden, runs 
joint PhD programmes with King’s College 
and University College in London as well 
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as the University of Toronto, and has two 
major research centres in Zhejiang and 
Shenzhen in mainland China. These re- 
search centres capitalize on HKU's critical 
location at the gateway to China and at- 
tract considerable funding. 

The university makes a special effort to 
ensure that the integrity of its output is 
consistent with the highest international 
standards. HKU is a leader in Asia in this 
regard. It provides a compulsory research 
ethics course for incoming research stu- 
dents and mandatory seminars for staff, 
and has established a dedicated research 
integrity fund. 

HKU's energetic pursuit of excellence 
in research has established its standing 
among the best universities in the world. As 
the university moves forward, it will focus 
on enhancing and sustaining that success, 
as well as broadening its vision. 

Three words sum up the future for re- 
search at HKU: innovation, internationaliza- 
tion and impact. While much groundwork 
has been laid for the first two goals, impact 
is a relatively new target. Research with 
impact is expected to lead to greater fund- 
ing opportunities, new collaborations and, 
most importantly, contributions that will 
advance society. 
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ity University of Hong Kong has es- 
tablished itself as one of the mos 
innovative universities in Asia, 
pioneering research in fields as diverse as 
creative media, energy, environment and 
veterinary medicine, and pursuing wide- 
ranging projects, from healthcare manage- 
ment to robotics, nanotechnology and 
superhydrophobic materials. 

A relatively young institution — i 
celebrated its 30th anniversary in 
2014 — City University of Hong Kong 


Visitors to an installation at City 
University of Hong Kong can experience 
the magnificent Buddhist wall paintings 
inside China’s famous Magao Caves in 
virtual reality. 


Advertiser retains sole responsibility for content 


has risen with remarkable speed in 
several global rankings. The QS World 
University Rankings 2014-2015 ranked 
he University 108th in the world, 11th 


in Asia and 5th in the world for universi 


ies under 50 years of age. According to 
data compiled by Shanghai Jiao Tong 
University for the Academic Ranking of 
World Universities 2014, City University 
of Hong Kong ranked first in Hong Kong 
for engineering, technology and com- 
puter sciences, as well as for the more 


specific subject area of mathematics. 
Globally, the University listed 24th for 
he broad fields of engineering, technol- 
ogy and computer sciences and 28th for 
mathematics. The University’s College of 
Business came out second in the Asia- 


Pacific region in the University of Texas at 
Dallas Top 100 Business School Research 
Rankings 2010-2014. 

The University’s global success has 
been, in part, a result of its commitment 
o the four ‘is of integration, innovation, 
interdisciplinarity and initiative. 


City University of Hong Kong believes that 
students benefit by learning directly from 
academics actively involved in research, as 
well as from teaching staff who immerse 
themselves in research. This philosophy 
drives the University’s pursuit of the inte- 
gration of teaching and research. 


City University of Hong Kong's dedication 
to innovation in teaching and research is 


typified by a recent study tour to one of 
the world’s most dramatic landscapes. 
Students and faculty at the University 
transformed data collected during a 
series of environmental experiments in 
Antarctica into creative media artwork, 
which was exhibited in Hong Kong. The 
result was a perfect merging of art and 
science in one of the most innovative 
expressions of fusion research supported 
by an institution of higher education. 


City University of Hong Kong also believes 
hat the traditional boundaries between 
disciplines are no longer adequate for 
advancing research, and that academic 
institutions need to facilitate further 
overlap between curriculums of different 
disciplines. By bringing together experts 
from across its campus, the University 
has been able to explore the many fac- 
ets of interdisciplinarity. One example of 


his is a recent healthcare managemen 
project, which involved experts from the 
College of Business and the Departmen 
of Systems Engineering and Engineering 
Management. The project so impressed 
the government of Hong Kong that i 
was awarded a HK$20.45 million gran 
from the Research Grants Council. 


The University has pioneered research 
into the intersection of art and technol- 
ogy with the creation of the School of 
Creative Media in 1998, and in 2009 


it set up Hong Kong's first School of 
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Energy and Environment. 
his groundbreaking spirit of initiative, 
in 2014 the University founded Hong 
Kong's first-ever School of Veterinary 


Extending 


edicine, in collaboration with the 
College of Veterinary Medicine of Cornell 
University in the United States. The 
School of Veterinary Medicine seeks to 
become a global in veterinary 
education and research, and to establish 
itself as a centre of excellence for One 


Health, an approach that recognizes the 


eader 


close links between 
environmental heal 


human, animal and 


th, ultimately to con- 


tribute to improvem 


ents in the quality of 


life in the region. 


Government support 

Support from the government of Hong 
Kong and authorities in mainland China 
is essential for financially sustaining the 
University and to validate its work. In 
the Research Assessment Exercise 2014 
conducted by the University Grants 
Committee of Hong Kong, City University 
of Hong Kong was ranked top among 
institutions in Hong Kong in terms of 
the percentage of highly rated research 


in the areas of building technology, 
chemical engineering, civil engineering, 
communications and media studies, 
and mathematics 


materials science, 


and statistics. 


Highly cited researchers 

Similarly, the talents of individual re- 
searchers affiliated with the University 
have been endorsed and recognized 
internationally. The University ranked 
second in Hong Kong in terms of the 
number of researchers named in the list 


of Highly Cited Researchers 2014, pub- 
lished by Thomson Reuters. These highly 
cited scholars work in the fields of chem- 
istry, engineering, physics, mathematics 
and materials science. 


Snapshot: Recent achievements 

* New robotics technology for 
treating diseases 
An interdisciplinary research team at City 
University of Hong Kong is collaborat- 
ing with the University of Hong Kong to 
develop an innovative cell manipulation 
and testing system by integrating the 
technologies of robotics, optical tweezers 
and microfluidics. 


Art and technology: Virtual reality 
applied to ancient Buddhist art 

City University of Hong Kong is pio- 
neering new methods for exhibiting 
archaeological sites, such as a stand- 
out installation that allows visitors to 
experience in virtual reality the mag- 
nificent Buddhist wall paintings inside 
the famous Mogao Caves at Dunhuang 
in western China. 


Food waste into bioenergy 

An innovative process developed at 
the University uses a mixture of bac- 
teria to generate bioenergy from food 
waste, heralding possibilities for the 
development of a viable source of re- 
newable energy. 


Breakthrough in nanobiology 

A young faculty member in the 
Department of Biology and Chemistry 
contributed to the development of a 
novel nanothermometer for probing the 


health of living cells. The research, con- 
ducted in collaboration with researchers 
at Harvard University, was published in 
Nature in 2013. 


Superhydrophobic material 

A team of researchers at City University 
of Hong Kong has developed the world’s 
most water-repellent surface, which 
can be used for fabricating anti-icing 
materials, as well as for self-cleaning, anti- 
bacterial and anti-corrosion applications. 
The research was published in Nature 
Physics in 2014. 


Durable infrastructure 

Joint research between City University 
of Hong Kong and Massachusetts 
Institute of Technology in the United 
States has identified why adhesives 
holding certain materials together frac- 
ture and fail and what could be done 
to strengthen these bonds, indicating 
enormous implications for a range of 
bonded materials. 
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CHINA'S RESEARCH & DEVELOPMENT SPEND 


China’s investment in research and development (R&D) is second only to the United States. 
Here, we analyse the data to give a snapshot of how and where the money is spent. By Xiaole Ni. 


THE BROAD PERSPECTIVE Data are from 2013, 


unless indicated. 
In 2013, China’s R&D spending increased by 15% on 2012 — second only to the United States. Around 75% of spending is by industry, 


which also contributes the majority of the funds. Looking at the breakdown, the largest category of research is technology development. 2013 conversion rate 
All figures are in billions of yuan. 1 yuan = US$0.16 
, 250.1 | 
CONTRIBUTORS TO R&D Government 50.8 Others 
883.8 Industry 
Ww 
178.1 Public 85.7 Higher- 
WHO SPENT IT? research institutes education institutes 
13.3 
907.6 Industry Others 
+ 
126.9 Applied research 55.5 Basic 
SPENDING BREAKDOWN PRlnGee te len 
1002.3 Technology development 


Investment hotspots 


There are 31 areas under central government control that together spend mainland China’s R&D budget. 
For some, this represents a more sizeable proportion of their local gross domestic product (GDP) than for others?. 
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HIGHER-EDUCATION INSTITUTIONS 


In 2012, 1,025 universities received a total of 76.9 billion yuan in R&D funding from the government, which 
they supplemented by raising additional money from other sources. Universities may be under the direct 
control of government ministries (for example agriculture, education or information) or of local government. 
Complicating the picture, some reports wrap R&D funding into the larger category of science and technology 
(S&T) funding, which totals 117 billion yuan and includes operational costs’. All figures are in billions of yuan. 


29 universities 
receive nearly 
half of all S&T 
funding. 


TOTAL R&D 
FUNDING 


FUNDS 


RECEIVED RECIPIENTS 


TOP EARNERS 


FUNDS SPENT Tsinghua 


Zhejiang 


333 junior 


Technology colleges) 


development 


Harbin Institute 
of Technology 


: 2 Shanghai Jiaotong 

Project g 
. Pekin, 

f ‘IN Applied > esi = : 

16 g = Fudan 
1 fo} 

FS Beihang 

‘ 2 Huazhong University of 

19.7 Basic Rest (581 Science & Technology 

universities, Tongji 


ie) 1 2 3 
S&T funding received (billion yuan) 


The figure of 76.9 billion yuan 
is from a different year to the 
85.7 billion yuan in ‘The broad 
perspective’. 


China’s top 111 universities are part of Project 211, a government initiative to raise research 
standards in its higher-education institutes. Project 211 includes a subset of 39 universities in 
the elite Project 985. Project 211 establishments receive the most funding; other universities 
receive less money and need to raise a greater proportion of funds by themselves. 


176.5 


Given to agencies 
| by central government | 


107.1 Used by other government agencies 69.4 Used by the top three funders 


Around 70 government agencies fund R&D. Such agencies include the 


GOVERNMENT FUNDING 


There are three main agencies, detailed 
in shaded pink bars to the centre right, 
that account for nearly one-third of 
central government R&D spend. 

All figures are in billions of yuan. 


Money spent by the three main agencies also 


176.5 


Ministry of Education, the Ministry of Industry and Information 
Technology, the Ministry of Health, the National Development and 
Reform Commission, and the Ministry of Defence. 


includes additional non-governmental income. 
Amounts shown for the NSFC represent 
approved grant funding. 


Central 
government 
spend 


MONEY SPENT ON RESEARCH 


Applied Basic R&D si Other 
250 | Applied Technology development Basic Other 
Total central and 
per government Talent 
pend on R&D Research programmes training Research support 1.9 


eS BL 


i Ministry of Science 


Chinese Academy National Natural Science 


of Sciences (CAS) & Technology (MOST) Foundation of China (NSFC) 
Hee 
government 
spend FUNDING STRUCTURE IN 2015 FUNDING STRUCTURE FROM 2017 


A complicated picture 


There are four types 
of mechanism in 
place to allocate 
funding. The top two 
types involve around 
100 competitive grant 
schemes that are 
overseen by 30 
different government 
departments‘. 


—_—+ 


Mission-oriented contract research 
Including defence-related spending 


Operational costs 
Including for the state key laboratories = 


China’s government announced that by 2017, the two competitive grant categories will be 
modernized and managed under a new science and technology agency into five channels. 


Sources: 1. National Bureau of Science and Technology of the Ministry of Finance (2013); 2. Higher Education Science and Technology statistics from the Ministry of Education (2013); 
3. China’s Ministry of Finance (2013); 4. Sun, Y. & Cao C. Science 345, 1006-1008 (2014). 
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2014 International Symposium on Research Assessment and Evaluation (RAE) 
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RAE and National Innovative Competitiveness 
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A panel comprising (L-R) Philip Campbell, Shen Wenqing, David Sweeney, Daniel Hook, Maki Kawai and Li Xiaoxuan discuss China’s research evaluation. 


SYMPOSIUM OVERVIEW 


Raising standards 


Growth in the science budget and in research output have been China’s key attributes over 
the past few decades, but the focus is now on how to boost research quality. 


BY MICHELLE GRAYSON 


( ountries the world over are looking to 
their research base to drive progress 
in an increasingly competitive envi- 

ronment. But how should research be guided 

or rewarded to achieve a country’s goals — 
be they academic excellence, as exemplified 
by breakthrough papers and awards such 
as the Nobel prize, or measured in terms of 
economic prosperity and social wellbeing? 
“Assessment and evaluation of research 
is a timely topic globally,” said Charlotte 

Liu, regional managing director for science 

(Greater China) and education (Asia) at 

Macmillan Science and Education, the par- 

ent company of Nature Publishing Group, 

when she opened the 2014 International 

Symposium on Research Assessment and 

Evaluation in Shanghai, China. “Crucial 

to this aim is to establish a comprehensive, 

rational and systematic evaluation frame- 
work to guide research investment and 
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resource evaluation.” At the symposium, held 
in October at the Shanghai Association for 
Science and Technology (SAST), repre- 
sentatives from academia, industry and 
government convened to talk about their 
experiences and to discuss courses of action. 


GROWING AND LEADING 

In his introduction, Yang Jianrong, vice- 
chair of SAST, outlined important elements 
that any research evaluation plan must 
address. The first, he said, was a “focus on 
the quality of research benefits, including 
the need for objective criteria”. He stressed 
that, in a world plagued by resource short- 
ages and environmental degradation, it is 
important to target innovations that are 
“beneficial to the planet’s long-term stability 
and sustainable green development”. 

The pages of Nature reflect the substantial 
growth of Chinese science in recent years, 
said the journal’s editor-in-chief, Philip 
Campbell. The ultimate aims of research 


© 2015 Macmillan Publishers Limited. All rights reserved 


assessment, he said, were “to incentivize 
good practice and critical and ambitious 
thinking”. Research has an effect beyond 
the institution or even the community in 
which it takes place, with the potential to 
influence many lives through new technol- 
ogy, new governmental policies and better 
health care. Therefore, said Campbell, it is 
crucial to “capture the impact of research in 
the fullest sense — from both academia and 
the wider research community”. 

Any evaluation of research needs to con- 
sider the interlocking chain of basic research, 
applied research, technology development 
and commercialization, said Zhang Xu, vice- 
president of the Shanghai branch of the Chi- 
nese Academy of Sciences (CAS). “A good 
system is not only 


conducive to the DNATURE.COM 
development of sci- _Toread the Nature Index 
ence and technol- 2014 China supplement, 


free online, visit: 
go.nature.com/yktnmy 


ogy, Zhang said, 
“but it will help our 
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COURTESY OF KURT WUTHRICH 


scientists grow, our education to improve, and 
help to create benefits for all of mankind and 
particularly for our environment.” 


PROMOTING THE UNDERACHIEVERS 

Kurt Wiithrich, a biophysicist dividing his 
time between the Scripps Research Institute in 
San Diego, California, the Swiss Federal Insti- 
tute of Technology in Ziirich, Switzerland, and 
the iHuman Institute at ShanghaiTech Univer- 
sity, talked about the way in which scientists 
are evaluated. Withrich is best known for 
sharing the 2002 Nobel Prize in Chemistry 
for the development of nuclear magnetic 
resonance spectroscopy. However, during his 
undergraduate years at the University of Bern 
he was more successful in his athletic endeav- 
ours. He reflected on his track speciality in the 
conference’s plenary address: “For the high 
jump, one needs to find children with talent 
and simply measure their achievement. Each 
attempt gives a clear and final result; good or 
bad.” But such obvious decision-making is 
not available to science, said Wiithrich. “The 
result of research, whether it is impact on 
quality of life or economics, may not appear 
for years or even decades.” 

The appointment and promotion of sci- 
entists, Wiithrich continued, is often under- 
emphasized in discussions about research 
assessment. “If we do not select these sci- 
entists well, then we will not get good value 
out” Withrich contends that there are many 
“talented underachievers” who might need 
encouragement to make better use of their 
skills, and who are usually overlooked in 
favour of people whose main ability lies in 
maintaining the status quo. These overachiev- 
ing scientists will not be the ones who produce 
the breakthroughs, he argued (see page S13). 

His message is that, rather than examin- 
ing researchers’ past output, more emphasis 
should be placed on finding, supporting and 
retaining the exceptional people. “There are 
very few who push things forward,” he said. 


IMPACT IN THE UNITED KINGDOM 

David Sweeney provided the counterpoint to 
Wiithrich’s experience — that is, from the per- 
spective of an assessing organization. Sweeney 
is a director of the Higher Education Funding 
Council for England (HEFCE), which evalu- 
ates research across the United Kingdom. The 
first task, said Sweeney, is to answer the ques- 
tion: “Why are you assessing?” 

For HEFCE, the answer to that question 
is clear: the evaluations guide the alloca- 
tion of funding. Most research in the United 
Kingdom is undertaken at universities, who 
then choose what projects and researchers to 
support. “We expect universities to take wise 
decisions,” he explained. 

But there is also another crucial question, 
Sweeney said: “What does research suc- 
cess look like — that is, what are you trying 
to achieve?” And here, he said, the answer 


is very much in flux. The economy of the 
United Kingdom, as in much of Europe, is 
struggling. “The government sees our uni- 
versities and their research as one of the most 
successful systems in the world and a key part 
of the approach to returning the economy to 
balanced growth.” The United Kingdom has, 
he said, “intellectual leadership in the devel- 
opment of new knowledge”. But whether this 
knowledge has a positive impact on society 
has not been clear. “We assume it has, but do 
we have the evidence?” 


Kurt Withrich training in Switzerland in 1956. 


For more than two decades, the United 
Kingdom ran the Research Assessment 
Exercise (RAE) to evaluate the quality of 
university research. According to Sweeney, 
the RAE had a hugely positive effect on the 
number of publications and citations, and of 
the quality of this output. “We think we are 
spending our money wisely,’ said Sweeney. 
“We want to spend it even better” 

And better, in this context, means more 
‘impact’ — a much-used term at this 
symposium and in discussions of science 
evaluation in general (see page $21). Sweeney 
explained: “Research impact is the demonstra- 
ble contribution that research makes to the 
economy, society, culture, national security, 
health, public policy or services, quality of life, 
and to the environment”. 

For the 2014 assessment, the United 
Kingdom transformed the RAE into the 
Research Excellence Framework (REF) and 
awarded 20% of the evaluation score for an 
institution on the basis of case studies, which 
describe the wider (non-academic) impact 
of research. However, modifying the assess- 
ment system is not intended to change the 
fundamental focus of UK research. “We don't 
want to discourage curiosity-driven research, 
but instead to prove that the best impacts 
come from this type of research, he said. 


THE ACADEMY FOR CHANGE 

China, unlike the United Kingdom, has no 
national system for research evaluation. 
However, the assessment process operating 
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within CAS — one of the largest research 
organizations in the world — can be seen asa 
microcosm for the country, said Li Xiaoxuan, 
director of CAS’s Institute of Policy and Man- 
agement in Beijing. 

China is a newcomer to the international 
research arena. “It was only 30 years ago 
that we started to reform, said Li. “We were 
closed, but we started to open up to a market 
economy.’ At the same time, efforts began 
to modernize industry and science, and the 
National Natural Science Foundation of China 
(NSEC) was established in 1986 as the main 
body for competitive project funding. “But 
we didn't appreciate how we should guide 
research through evaluation and assessment,” 
he added. 

In 1990, the government began evaluating 
its national key laboratories, and shortly there- 
after CAS began evaluating its own research 
institutes. The purposes behind the evalu- 
ations, said Li, were: to help select the best 
people through competition; to raise China’s 
science and technology output, in line with 
international levels; to promote efficient 
resource allocation; and to bring scientific 
decision-making into the management of 
research and development (R&D). 

But there were problems with the tools 
developed for these tasks. “The biggest 
issue was that there was too much focus on 
quantity,” Li admitted. Scientists would focus 
on increasing the number of projects they 
conducted and how many articles they wrote. 
“Tt led to very short-term behaviour,’ said Li, 
with the knock-on effects of research miscon- 
duct and wasted funding. 

To provide more helpful incentives, CAS 
is now moving towards qualitative evalu- 
ation (see page S18). Since 2011, CAS has 
been using the One-Three-Five programme. 
The name stands for one orientation, three 

breakthroughs and five 


“A ssessment major directions. “It is 

andevaluation cused on outcomes, 

of researchis not papers said Li. 

a timely topic By ocusing its 
research institutes on 


» 
globally. the One-Three-Five 


plan, CAS is hoping 
to avoid duplication and create areas of spe- 
cialization, to help its institutes achieve major 
breakthroughs and maintain its fast pace of 
development. Results so far from the 19 insti- 
tutes evaluated show that around one quar- 
ter can be considered world-leading — the 
highest determination, said Li. One of the 
core features of the plan is that is uses third- 
party assessment, including evaluation from 
international experts. Feedback from such 
experts should quickly identify any problems 
in management and provide constructive sug- 
gestions. And, as with the United Kingdom's 
research evaluation scheme, the plan “enables 
us to keep a balance between basic science and 
applied research’, Li said. 
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Wang Minmin, head of chemistry at Eli Lilly’s Shanghai-based China research and development centre. 


The afternoon of the symposium comprised 
panel discussions on the broader implications 
of research evaluation. One of the recurrent 
themes was how best to support and evaluate 
young scientists. 

“Evaluating young people is different from 
evaluating older, established scientists,” said 
Li Mengfeng, vice-president of Sun-yat Sen 
University (SYSU) in Guangdong. “We need 
to look for potential rather than looking at 
past achievement.” 

Shen Wenqing, an academician at CAS, 
observed that the pervasive conservatism 
of Chinese society tended to “kill advanced 
ideas” — ones that have a low probability of 
success but potentially high impact. Echoing 
Wiithrich’s earlier talk, Shen urged a greater 
focus on encouraging young scientists to 
pursue original ideas. 

These aims are also 


£ ee “We need 
reeen Jap = said to look for 
Maki Kawai, executive . erie ae 
director of RIKEN, : 
looking at past 


Japan’s largest dedi- 
cated research institu- 
tion. Young scientists 
are overburdened by Japan’s onerous annual 
evaluation system, she said. Moreover, the 
current system does not take diversity into 
account. “We should not choose just one sort 
of person,” she said, “it is important to have 
flexibility in the system” 

CAS’s Zhang raised the issue of how best 
to evaluate genuine breakthrough ideas. He 
called on the research community to help 
develop a new way to determine a paper’s 
quality, one that is not entirely dependent on 
the journal in which it is published, or on how 
many times it has been cited. Such a broaden- 
ing of criteria would be particularly helpful to 
researchers at the start of their career. “At the 
very beginning, it is hard to publish in high- 
impact journals,” he said. 

Anthony Cheetham, vice-president of the 
United Kingdom’s Royal Society, took up 


achievement.” 
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Zhang’s point. He noted that “many papers 
that have recently won Nobel prizes were 
actually published in second-tier journals”. 
Having papers in these journals only should 
not be an impediment to promotion, he said 
(see page S34). 

Participants started to reach a consensus 
on this point. Wang Xiao-Jing, associate vice- 
chancellor for research at New York University 
Shanghai, added that “these high-impact jour- 
nals can inform assessment but should not be 
all of it”. When interviewing someone for a 
position, Wang Xiao-Jing advocates “spend- 
ing a day reading all their work”. 

Providing a view from industry was Cory 
Williams, head of clinical-trial management 
for Pfizer’s Shanghai R&D centre, part of 
the New York-based pharmaceutical firm’s 
global network. He spoke about the three 
leading indicators that he looks for in young 
researchers. The first two concern uniqueness 
of research and productivity, but the third is 
perhaps the most important: capacity for col- 
laborative research. “Through collaboration 
you can reach new skill sets,” he said. “These 
interactions lead to more innovation” 

Williams’s advice is to “reverse engineer 
what a distinctive researcher looks like at vari- 
ous points in their career”; then it will be pos- 
sible to measure and mentor people towards 
that template from the beginning. 


SOCIETAL IMPACT 
Another emergent theme was on the chal- 
lenges in measuring and understanding the 
impact of all the diverse products of research 
for society. Campbell gave examples that illus- 
trate how high-impact research is not always 
published in high-impact journals. “Multidis- 
ciplinary teams of natural and social scientists 
creating solutions for water-stressed cities” 
is hugely important, despite rarely being 
published in the leading journals. 
Assessment of science is of critical impor- 
tance to research-funding agencies. According 
to Chu Junhao a physicist at CAS’s Shanghai 
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Institute of Technical Physics, there are four 
factors that can help funders to determine the 
societal value of projects: whether the research 
led to new knowledge; to industrial output; to 
enhanced technical proficiency; or to develop- 
ment of new expertise. 

For some areas of research, the main impact 
will be in terms of government policy. Lu 
Yonglong, an environmental scientist at CAS’s 
Research Center for Eco-Environmental 
Sciences in Beijing, spoke about his speciality. 
“For us, the point is not to just have papers 
in top journals, but to ensure that the public 
understand what the issues are.” 

A key element in public understanding 
of science is accessibility of research papers. 
“We are more likely to have solutions if 
everyone has access to the relevant informa- 
tion,’ said Carrie Calder, strategy director for 
open research at Nature Publishing Group. 
The value of open access is particularly 
important when it comes to solving global 
challenges such as food security, pollution 
and climate change, which require cross- 
disciplinary collaboration. 


SEEDING NEW COMPANIES 

The final headline topic grappled with how 
to move from research to the creation of new 
companies. Guo Chongqing, a mechanical 
engineer at Tongji University in Shanghai, 
provided a perspective from his four decades 
at the forefront of engineering design. Guo 
said that China has come a long way in the 
past few decades, but that R&D still has its 
problems. In particular, he said, “there is 
too much meddling and interference from 
government”. 

But there are good signs in certain sectors. 
“There is a lot of enthusiasm and drive in Chi- 
nese universities to contribute to drug discov- 
ery,’ said Wang Minmin, head of chemistry 
at the Shanghai R&D centre of Eli Lilly, a US 
pharmaceutical company based in Indianap- 
olis. But key to this field is the ability to learn 
from failure. “We should emphasize pub- 
lication and sharing of these stories,” Wang 
Minmin said. At Eli Lilly, she explained, they 
have “started to celebrate any outcome, posi- 
tive or negative, to allow people to openly say 
what they’ve learned”. Such a policy helps to 
share information and prevent duplication of 
efforts. Failure, in this context, is also success. 

The symposium closed with some reflec- 
tions on the day’s discussions from Chen 
Kaixian, chair of host organization, SAST. 
This meeting, involving participants from 
different scientific evaluation systems, 
“should help promote understanding of each 
other, through the blending and collision of 
ideas”, said Chen. As delegates prepared to 
return home to their companies and labora- 
tories, there was a strong urge to take some 
of the lessons back with them to have an 
impact on how science is assessed around 
the world. = 
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Q&A Kurt Withrich 
Science regeneration 


Nobel laureate Kurt Wiithrich is a Swiss biophysicist who works at the Scripps Research Institute 
in La Jolla, California, and the Swiss Federal Institute of Technology in Ziirich. In 2013, he was 
appointed as a professor at ShanghaiTech in China, where he is helping to build the institution's 
science faculty. 


What took you to China? 

My first trip to the country was in 1983. I was 
secretary general of the IUPAB [International 
Union of Pure and Applied Biophysics]. Dur- 
ing those years, mainland China was just 
beginning to emerge from decades of isola- 
tion. In the 1970s, there was a historic ping- 
pong match between US and Chinese athletes 
to promote diplomacy, and similar things were 
happening in science. 

In the IUPAB and other scientific unions, 
China was represented by Taiwan; mainland 
China was isolated. We changed the statutes 
of the IUPAB so that we could accept two 
representations for China — one from Taipei 
and one from Beijing. That way, mainland 
China could rejoin the international scientific 
community. This was a very lengthy process 
and at times frustrating, but eventually it was 
ahighly rewarding affair. 


What was the scientific community in China 
like when you first visited? 

There were high points in Chinese science 
already: at the Shanghai Institute of Biochem- 
istry and Cell Biology, researchers had deter- 
mined the structure of insulin and had also 
sequenced a transfer RNA molecule. Those 
were incredible feats. But the scientists who 
were in charge in 1983 had barely survived the 


Cultural Revolution, including physical abuse. 
They went through terrible times. There were 
very few who published good work. 


How does China’s political history affect the 
country’s science research? 

China lacks two generations of scientists, 
who should have been studying during the 
Cultural Revolution. So although they 
reopened the universities that had been 
closed, they didn’t have qualified individuals 
to fill the positions. 


Even today, there are “It is very rare 
thousands of poorly to findnew 
qualified professors in Chinese PhDs 
universities and col- whowantto 
leges. This is, of course, doapostdocin 
avery difficult situation, China — they 
and there aresome pro- all wantto go 
grammes toaddressthe q@broad.” 


resultant problems. For 

instance, only a small percentage of professors 
in Chinese universities are empowered to 
advise graduate students. 


Many new colleges and universities have been 
founded in China in the past ten years. How 
are they progressing? 

Many have no high-level research faculty. In 
contrast, I have witnessed the construction 
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of an entirely new campus for ShanghaiTech 
University, where I am involved in hiring 
scientists. We try to employ outstanding 
scientists who are originally from China, most 
of them are in the United States. We also try 
to hire non-Chinese scientists. We can offer 
start-up packages that are much better than 
those in the United States, yet it is still very 
difficult to attract top-level scientists. Further- 
more, it is very rare to find new Chinese PhDs 
who want to do a postdoc in China — they all 
want to go abroad. That’s a problem when you 
are trying to recruit good scientists to build a 
research team. There is also uncertainty about 
howlong the present ‘good life’ in China, with 
all of its support for scientists, will last. 


How should Chinese institutions evaluate their 
faculty members? 
What needs to be done is to establish an evalu- 
ation process that is based on the quality of 
science. And this is a difficult thing. Institu- 
tions are at a very early stage of introducing 
proper peer review. Currently they place great 
importance on publication records. Accept- 
ance of papers in international journals, based 
on peer review by scientists outside China, is 
taken as a very positive asset. Because there 
is no functioning evaluation system within 
China, the importance given to publication in 
high-impact journals is enormous. Another 
problem is that, overall, productivity in China 
is very high but quality is not. Publication 
numbers go up, but citation records don't nec- 
essarily follow. The solution is strict reviewing 
of faculty by international experts in science. 
China must open its doors to review its 
system, with help from the international 
community. Many Chinese faculty want to 
open up, but a lot of power in China is with 
the administrators, who often don't write 
or speak English. Most of the world’s top 
scientific journals are in English. 


What can Chinese universities do to train 
better scientists? 
The first recommendation is that all teaching 
in the natural sciences should be in English. 
The second is that the most highly talented 
candidates should be sent abroad, with an eye 
to recruiting them back as faculty in China. 
The third is the most difficult one. That is 
to take risks and to lavishly support a small 
number of individuals who promise to do 
something really new. Scientists who are 
doing truly original work often don’t have a 
great citation record and don't get published 
in the leading journals, because it’s hard to 
find someone to evaluate the work. Therefore, 
the individuals in charge must take time to 
meet with candidates and have a good sense 
of what high quality means. Keeping this 
in mind, science education in China looks 
forward to a bright future. = 
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COLLABORATIVE EFFORTS OF CHEMISTRY 
FOR ENERGY MATERIALS 


o achieve breakthroughs in en- 

ergy science and technology, the 

Collaborative Innovation Center 
of Chemistry for Energy Materials 
(CHEM) was jointly established in 
September 2012 by Xiamen University 
(XMU), Fudan University (FDU), the 
University of Science and Technology 
of China (USTC), and Dalian Institute of 
Chemical Physics (DICP) of the Chinese 
Academy of Sciences (CAS) under 
the framework of the Collaborative 
Innovation Plan of the Ministries of 
Education and Finance of China, the so- 
called 2011-Plan. 

The departments at XMU, FDU and 
USTC are among the top chemistry 
departments in China, while DICP is re- 
nowned for its leading role in catalysis 
and energy-related research in China. 
Such collaboration across multiple 
disciplines, between different depart- 
ments and major research laboratories 


in universities, research institutes and 
industries is necessary to facilitate 
world-class research. 


iChEM'’s highly efficient 
coordination 

iChEM proposed and established a mul- 
tidimensional and innovative “iChEM 
Efficiency Enhancement System” to 
encourage flow and collaboration of 
members. This forms a comprehensive 
innovative culture and mechanism for 
remote multidisciplinary collaboration. 


The system includes the CHEM Scholars 
system, the iCHEM Sub-Platforms and the 
iChEM All Member Service system. 

The iChEM Scholars system encourages 
outstanding iChEM talents to interact 
with researchers of all levels at multiple 
iChEM research departments. This attracts 
more talents to iChEM and significantly 
increases collaboration among multiple 
units, platforms and mentors — leading 
to an overall increase of research quality 
and greater efficiency in human resources. 
The iChEM Sub-Platforms make use of 
the unique academic and physical resourc- 
es of each collaborative unit to construct 
distinctive research platforms. The sub- 
platforms amass complementary resourc- 
es both from iChEM and externally and 
allocate them throughout iChEM's multiple 
units and laboratories. Researchers from 
different units are able to conduct collab- 
orative research on the sub-platforms to 
achieve a common research goal. 


Ss 


Attracting talented researchers 

As the leader in energy chemistry in- 
novation in China, iChEM holds com- 
plete autonomy on talent recruitment, 
position setup and student admissions 
quota. iChEM drafts its own recruitment 
standards, position goals and missions 
and recruitment protocols. Talented re- 
searchers are recruited worldwide and 
managed via the ‘iChEM Scholars’ system. 
iChEM provides a world-level research 
platform, comfortable apartments and 


internationally competitive stipends or 
salary for CHEM Scholars in order to attract 
expert scientists. CHEM scholars include 
iChEM Outstanding Professors selected 
among members of Academies of Sciences 
and awardees from the Thousand Talents 
Plan, CHEM professors selected among full 
and associate professors from renowned 
universities and research institutes, (CHEM 
research assistants selected from outstand- 
ing postdoctoral fellows, and iChEM post- 
doctoral fellows selected from individuals 
who have obtained their doctoral degree 
within the past three years. CHEM also at- 
tracts and selects outstanding candidates 
for doctoral study with excellent mentor 
teams, challenging research projects, and 
generous scholarships. 

iChEM has recruited a number of 
well-known experts in chemistry and 
materials for energy, including ten CAS 
members, nine Thousand Talents Plan 
awardees, 39 National Distinguished 
Young Scientist Grant awardees, and 11 
Cheung Kong scholars. 
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Visit: http://www.2011-ichem.org/en/ 
Email: 2011-ichemfaxmu.edu.cn 
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stablished in 2012, the Collaborative 


Innovation Center of Chemical 

Science and Engineering (Tianjin) 
was approved by the Ministry of Education 
of China in April 2013 to become one of 
14 centres in an initiative to promote in- 
novation in research and development 
and interdisciplinary studies at colleges 
and universities. 


The School of Chemical Engineering 


and Technology of Tianjin University and 


the Chemistry Department at Nankai 
University are highly regarded in China. 
They are located close to each other and 


have a long tradition of conducting inter- 
disciplinary research and education. The 
centre is founded on collaboration be- 
tween the School of Chemical Engineering 
and Technology at Tianjin University, the 
College of Chemistry at Nankai University, 
the Institute of Process Engineering of the 


Chinese Academy of Sciences, the Sinopec 
Group (a Fortune Global 500 company) 
and the Tianjin Bohai Chemical Industry 
Group. The centre conducts research in 
the areas of design and green synthesis of 
functional materials and highly efficient, 
clean conversion and utilization of energy 
and resources. It performs both fundamen- 
tal and applied research to solve key issues 
in the fields of energy, resources, environ- 
ment and healthcare. 

The Collaborative Innovation Center 
of Chemical Science and Engineering 
(Tianjin) has gathered many world-lead- 
ing scientists in the fields of chemistry 
and chemical engineering. In the area 
of design and green synthesis of func- 
tional materials, these scientists mainly 
focus on studying structural effects and 
modulating the properties of functional 
molecules as well as developing new, 
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highly efficient green reactions, reagents 
and catalytic systems. Researchers a 
the centre also investigate the catalytic 
modes, chemical reaction rules, the ef- 
ficiency and selectivity of the creation 
and/or conversion of chiral materials 
and the precise and perfect synthesis of 
chiral drugs and materials. Furthermore, 
they study the design, synthesis and ap- 
plication of artificial biosystems, which 
are associated with synthetic biology, 


genomic and pesticide chemical biology 
and organic synthetic chemistry of bioac- 
ive molecules. 

In the areas of conversion and utiliza- 
ion of energy and resources, the centre 
mainly focuses on studying catalytic the- 
ory and engineering technology related 
o highly efficient and clean conversion 
of coal, synthesis gas and carbon dioxide. 
For the development of new energy and 
resources, it focuses on studying design 
heory and engineering technology of 


high-efficient photoelectric conversion 
materials and devices, and highly eff- 
cient light-catalysed reactions. 


Visit: www.tju.edu.cn/2011/english/ 
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Information for postdoctoral applications for the 
Collaborative Innovation Center of Chemical Science 
and Engineering (Tianjin) 


The Collaborative Innovation Center of Chemical Science and Engineering (Tianjin) invites postdoctoral candidates 
from around the world to join its team. 

The centre will provide a good working environment, research and development facilities, as well as professors for 
supervising projects. The total annual salary will be up to 250,000 RMB. An extra bonus will be awarded to exceptional 
candidates, who will be recommended for appointment as a staff member at Tianjin University or Nankai University. 

Candidates should possess a doctoral degree from a well-known university or institution (either in China or 
overseas), good physical and psychological health, and an excellent scientific research track record. They should be a 
team player with an innovative spirit. 

Candidates should have a research background in functional molecule synthesis, synthetic biology, catalytic science, 
highly efficient and clean conversion of coal, synthesis gas and carbon dioxide, clean energy, synthesis and application 
of chiral molecules, high-efficiency photoelectric conversion materials and devices, high-energy fuel, or other areas in 
chemistry and chemical engineering. Outstanding candidates will receive competitive advantages. 

When applying, please provide a resume (which should indicate the level of scientific research conducted, present 
and potential ability, and a self-assessment), a proposal for research to be conducted at the centre, two or three letters 
of recommendation (including one from a doctoral or postdoctoral supervisor), a document outlining representative 


research achievements and scanned copies of proof of identity and degree certificate. 


Please send an email with the subject “Graduate school [name of major-research 


direction]” with the above materials attached to hxhg@tju.edu.cn. 


For more information, please visit http://hxhgxt.tju.edu.cn. We look forward to hearing 


from you. 
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Liu Zhongmin (front row, third from right) receives the prestigious First Class State Technological Invention Award from President Xi Jinping in January 2015. 


Moving away from metrics 


The Chinese Academy of Sciences is changing the way it assesses its research. But finding 
the right balance in such a big organization is a tough job. 


BY HUANG KUN 


President Xi,’ says Liu Zhongmin, a 
chemist at the Chinese Academy of 
Sciences (CAS), a country-wide network of 
institutions with headquarters in Beijing. “It’s 
a glorious moment. It is the symbol of social 
recognition of our research, on which my 
colleagues and I spent more than 30 years.” 
Liu is referring to the First Class State 
Technological Invention Award that he 
received from China's President Xi Jin- 
ping at China’s annual National Science and 
Technology Awards Conference on 9 January 
2015. His award-winning invention is dimethyl 
ether/methanol to olefins (DMTO) technol- 
ogy, which allows the chemical industry to 
substitute coal for oil in many applications. 
That is profoundly important for China, which 
is poor in oil but rich in coal. Liu and his CAS 


C Cc | = thought it would be awarded by 
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colleagues invented the process and pioneered 
its use in industry. The total production capac- 
ity for olefins from 7 DMTO plants in China 
reached 4 million tonnes per year in 2014. 

Encouraging researchers such as Liu to 
make breakthroughs like DMTO technology 
is the goal of CAS’s new assessment system. 
“That is a good model for constructing the 
major outcome-oriented system for evalu- 
ation at CAS? says Li Xiaoxuan, director of 
the Management Innovation and Evaluation 
Research Center at CAS. The centre, based in 
Beijing, is the academy’s official think tank for 
research assessment. 

And CAS is big. According to its lat- 
est report, CAS has 56,000 researchers 
and technicians, plus another 12,000 non- 
technical staff. It consists of 104 research 
institutes across China, as well as universi- 
ties, companies and other entities. In 2013, 
through a combination of government 
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funding and other sources, CAS had budget of 
41.9 billion yuan (US$6.8 billion), most of 
which was spent on research and development 
(R&D) (see page S8). 

In the past, CAS has evaluated its research 
output with quantitative metrics and rankings. 
But now it is changing the way it operates to 
give a more nuanced appreciation of progress. 


GIVING IT BACKBONE 
The core of the current evaluation system at 
CAS is known as the One-Three-Five plan. “It’s 
difficult to have a panorama of everything in 
this big organization,’ Li says. “But now many 
things can be connected to this backbone.” 
One-Three-Five is a guide to the way that 
each CAS institute should operate. That is, 
each institute should stake out for itself one 
major research area where it positions its core 
competencies, set out to make three major 
breakthroughs in the next five to ten years and 


DALIAN INSTITUTE OF CHEMICAL PHYSICS, CAS 


DALIAN INSTITUTE OF CHEMICAL PHYSICS, CAS 


map out five specific research priorities that 
will provide competitive advantage. 

Reforms that led to the One-Three-Five 
plan began in 2011, when chemist Bai Chunli 
started his term as president of CAS. He told 
Nature at the time that his plan was to “reduce 
the frequency of research evaluation while 
improving its quality” (see go.nature.com/ 
gajb 1x). Moreover, he said, research would be 
judged “based on national needs and socio- 
economic benefits”. 

The following year, Bai expanded on his 
ideas in an article in the Bulletin of the Chi- 
nese Academy of Sciences (see go.nature.com/ 
xgnkde). In the article, Bai argued that as the 
growth of China’s economy slowed from the 
frantic pace of the past two decades, it was cru- 
cial to also shift the focus of research towards 
innovation. Thus, the evaluation system at 
CAS should encourage researchers to solve 
scientific problems, create new research fields 
and make major technological advances. 


FROM QUANTITY TO QUALITY 

One of the main differences between the 
current and past evaluation systems at CAS 
is the extent to which quantitative measure- 
ments are used. For much of the past decade, 
institutes were ranked by combining a number 
of indicators, each with a different weighting, 
in various complicated formulae. There were 
24 basic indicators, including the number of 
publications and citations, amount of funds 
raised, awards gained and patents issued. 

This heavily quantitative assessment 
approach led to what Li calls the Science 
Citation Index (SCI) phenomenon — an 
abnormal emphasis on increasing the number 
of papers published in journals in the SCI, even 
if the impact of such research on industry and 
society was low. Li believes that the new evalu- 
ation system will be more effective at aligning 
research incentives with the country’s overall 
needs. “The shift of emphasis to qualitative 
assessment will encourage major outcomes 
that really matter,” he says. 

The quantitative system had winners and 
losers. “Our institute always ranked at the 
top,” recalls Liu proudly. He joined CAS’s 
Dalian Institute of Chemical Physics (DICP) 
in Liaoning province as a postgraduate in 
the 1980s. Three decades later, he is now the 
deputy director of DICP. 

Liu believes that DICP is still one of the top 
five performing institutes at CAS, despite the 
lack of quantitative ranking. Although the 
basic indicators are still collected, the num- 
bers are no longer used to calculate a rank 
order. Instead, each institute submits its data 
to CAS headquarters, which sends back a 
report. “That’s very useful for us to analyse 
variations and so adjust our work,’ Liu says. 
These indicators now form the foundation of 
the One-Three-Five plan, and assessment is in 
the form of two sections. The first section is 
an expert assessment — a form of peer review 
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A dimethyl ether/methanol to olefins complex in the Shenhua Baotou plant in Inner Mongolia. 


— to check on progress towards the strategic 
goal, which takes place once every five years. 

The DICP’s expert assessment was 
conducted in 2013. The experts recognized 
the DMTO technology as one of the insti- 
tute’s major breakthroughs, which qualified 
the project for additional funding from CAS. 
“And we have many research areas that were 
judged to be in the first tier in the world? Liu 
adds. First tier being the equivalent of ‘world- 
leading’ research; there are four tiers for rating 
research from an international perspective. 

From 2012 through 2013, CAS con- 
ducted international expert assessment of 16 
research institutes, including DICP. Experts 
invited ranged from journal editors to chairs 
of international societies and associations. 
They assessed 90 research areas within the 16 
institutes, judging 26 to be in the first tier and 
53 to be in the second. 


INSIDE INSTITUTES 

To supplement these overarching evaluations, 
each CAS institute performs its own assess- 
ment of its research groups and individuals. 
The institutes aim to stimulate competition 
and ensure that the best researchers are 
retained. 

DICP is one of the biggest institutes at CAS 
and employs more than 1,000 staff in around 
80 research groups. DICP has been performing 
its own internal evaluation system for many 
years. In the past, DICP’s indicators included 
quantitative elements, but now, just as CAS is 
reforming to be more qualitative, DICP is also 
moving in that direction. 

“We have cut some indicators and now 
are using four basic criteria: the clearness of 
research direction; the closeness to our One- 
Three-Five strategy; the progress of their work; 
and the potential for future development,’ Liu 
explained. “Qualitative judgement based on 
these criteria comes from an academic com- 
mittee — in essence, a peer review.’ And these 
internal assessments can have high stakes. 
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DICP, for example, evaluates and ranks all of its 
research groups every two years. “The bottom 

5% are eliminated,” says Liu. 
Not all CAS institutes are moving as quickly 
or decisively as DICP. In north Beijing, near 
the stadium that hosted 


“The shift of the 2008 Olympic 
emphasis to Games, there is a clus- 
qualitative ter of 12 CAS institutes. 
assessment a see inom hana 
willencourage SDS WHO Works mone 

of the institutes, but 
outcomes that 


who wishes to remain 
anonymous for fear 
of repercussions for 
speaking out. He is an associate researcher in 
his thirties and is one of an estimated 14,000 
‘middle-class’ professionals at CAS. He says 
that despite these recent changes at the top 
level of CAS, his evaluation — and that of the 
many others like him — still relies heavily on 
the old system of indicators. 

“I have received money according to the 
number of papers I published, with additional 
money for each citation of the paper,’ he says. 
This kind of quantitative reward system is 
still very much in existence in many insti- 
tutes, he adds, they vary only in the details. 
“Papers bring us monetary rewards, which are 
given at the end of each year and are typically 
equivalent to three or four months salary,” he 
says. “I have known some highly cited papers 
that brought the authors rewards worth more 
than a whole year’s salary — or even of several 
years’ salaries.” 

These indicator numbers are also important 
references for career advancement within CAS. 
Papers, citations and funds received are con- 
sidered ‘hard assets’ to show to the academic 
committees that evaluate individuals for pro- 
motion. Chinese society values relationships 
— knowing the right people is important and 
the academic circle is no exception. Therefore, 
relying on hard assets is one way to make sure 
that evaluations are not unduly influenced by 


really matter.” 
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scientists’ friendship circles. So there is certain 
rationality in utilizing these numbers, leaving 
the balance between quantitative and qualita- 
tive methods to be a delicate issue. 

“Inviting foreign experts to participate in 
the assessment section of the One-Three-Five 
plan is a good attempt to solve this issue,’ says 
Mao Shude, an astrophysicist at the National 
Astronomical Observatories of the Chi- 
nese Academy of Sciences (NAOC), based 
in Beijing, who spent 22 years studying and 
working in the United States and Europe. Mao 
explains that a group of high-profile inter- 
national experts will provide a more objec- 
tive evaluation as they have fewer conflicts 
of interest or established relationships than 
domestic ones. 


ROOM FOR IMPROVEMENT 

Mao, who recently took a joint position at 
Tsinghua University in Beijing, says that the 
reform of the evaluation system at CAS is “on 
the right course”. But compared with similar 
processes at international institutions, he says, 
there is still room for improvement. 

“At foreign universities, the contribution of 
a person consists of three parts,” only one of 
which is their research, says Mao. The second 
part is teaching, which has been largely absent 
at CAS but is becoming more important since 
the University of the Chinese Academy of 
Sciences started recruiting undergraduates 
in 2014. The third part, Mao says, is scientific 
cooperation and the research environment. 
“Groups within CAS institutes tend to operate 
separately, which is not conducive for an 
interactive and cohesive atmosphere.” 

Mao returned to China in 2010 as part of 
the national ‘1,000 Talent Plan, which recruits 
high-level experts — both Chinese nationals 
and foreign professors — from overseas and 
offers them well-funded academic positions 
to carry out their research in China. But, he 
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China’s President Xi Jinping (third from left) visits the Chinese Academy of Sciences in Beijing in 2013. 
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says, opportunities at Chinese institutions 
should be based on academic merit, not just 
experience gained overseas. “Although I ama 
beneficiary of this system, it’s not fair for some 
domestic researchers who did similar or even 
higher-quality work but were not recognized 
as such.” This situation is changing, how- 
ever, as the Chinese 


government has just “Groups within 
started a similar tal- CAS institutes 
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than relocating over- 
seas, notes Mao. 

Another area for 
improvement is the way in which the evalu- 
ation considers the specific needs of female 
researchers, says Mao. He notes that the 
percentage of females in permanent positions 
in his division, from assistant researcher to 
the highest levels, is just below 15%. “I have 
two daughters and I am very sensitive about 
gender equality,” he says. “It should be taken 
into account that women often bear additional 
family responsibilities.” 

In the Gravitational Lensing and Galaxy 
group at NAOC, where there are more than 
a dozen researchers, Wang Yuting is the only 
woman. Although she feels she is treated as 
equal to her male colleagues, she is aware that 
most female postdoctorates like her are at an 
age at which they feel they have to choose 
between research and family life. “Some 
female scientists did give up research because 
of family and kids.” 

According to the United Nations Educa- 
tional, Scientific and Cultural Organization, 
30% of the world’s science researchers are 
women. Although specific data are not 
available, the situation is broadly similar in 


atmosphere.” 
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China and is more acute in the more sen- 
ior positions. In 2011, the government set a 
target of increasing the percentage of women 
in all high-level professional positions to 
35%, including a special section devoted to 
improving the number of women as career 
scientists. For this to be achievable, the 
evaluation system would need to be more 
flexible, as it is in many Western countries, 
to allow for a career break and support when 
women return after childbirth, says Mao. 


MORE LEADS 

With regards to determining the trajectory 
of CAS, the most important voice is that of 
China’s President Xi. The president visited 
CAS in July 2013 and in a speech suggested 
that CAS should aim for “four leads”: the 
development of technology; nurturing the 
best minds; contributing to science and 
technology policy; and becoming a first-class 
international research organization. 

In 2014, based on Xi’s instructions, CAS 
started a project called Lead Action. It 
proposed that its institutes be divided into 
four categories, each with different evalua- 
tion systems based on their missions. These 
are: institutes that aim to meet the needs of 
national strategy and industrial development 
should be evaluated by the people and organ- 
izations who use the technologies; institutes 
that focus on academic achievements should 
be evaluated by an international academic 
peer group; institutes with big scientific 
facilities should be evaluated by users and 
colleagues; and other institutes with subject 
specialities should be evaluated by experts in 
the same field. 

Lead Action is gathering momen- 
tum. In February 2015, CAS released its 
latest guidelines for the Academy (the last 
revision was in 2002). Alongside the four leads 
are three orientations to ensure that CAS is 
matching its output with global science and 
technology efforts, the major needs of the 
country; and the “main battlefields of the 
national economy”. 

The outlook of the evaluation system at CAS 
is evolving. There are still issues to address in 
the way that different groups are treated, in 
ensuring that the various institutes match the 
overall strategy, and in finding the balance of 
qualitative and quantitative measurements. 
But the direction is clear. As Liu explains, 
flaws in the former evaluation system gave 
unclear signals to scientists as to what was the 
most valuable outcome. “There were multiple 
batons that pointed in different directions,’ he 
says. Now, he believes that the modified assess- 
ment system has brightly illuminated the goal: 
“Chinese scientists should do things that are 
useful to China first of all, then they could con- 
sider the rest of the world.” = 


Huang Kun is a science correspondent based 
in Beijing. 
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The numbers game 


David Sweeney, a director of the Higher Education Funding Council for England (HEFCE), 
outlines the importance of accurately assessing the benefits of academic research and the 


dividends it can bring. 


How do China and the United Kingdom differ 
in their methods of evaluating and funding 
research? 

For half of the higher-education assessment 
and funding in the United Kingdom, we don't 
make judgements about individual projects. 
We fund as a block grant without specifying 
what the money should be spent on. This 
means that a university can earn money 
because of its excellence in physics and spend 
it on drama. My understanding is that the 
Chinese approach involves more direction 
concerning the disciplines and institutions 
that the money will be spent on. 

The United Kingdom’s competitive research 
grant system is, of course, based on judgements, 
but those judgements are made solely by 
discipline experts. The only role for centralized 
planning by government is in the area of capital 
expenditure. Overall, the government and 
research funders cede the judgements to either 
discipline experts in the research councils or to 
university management. 


Should China take inspiration from the United 
Kingdom in its efforts to reform its research 
assessment and funding systems, or vice 
versa? 

The UK system is relatively mature. We've been 
conducting research in universities for a very 


long time, and we've been doing it in the way 
we do it now for at least 25 years. We are not 
on the whole trying to build research capacity. 
Our system serves us well and suits our cul- 
ture, but that doesn't necessarily mean it is right 
for another country. The real question is: should 
you do things differently in a research system 
that is in the process of establishing itself? At 
the 2014 International Symposium on Research 
Assessment and Evaluation, held in Shanghai 
and co-hosted by Nature (see page S8), I was 
very taken by some of the honest comments 
about how twentieth-century Chinese history 
affects the nation’s research structures, notably 
how the age cohorts of the researchers in China 
essentially reflect historical events such as the 
Cultural Revolution. We have nothing like that 
in our recent history that has changed the way 
we do things in the United Kingdom. It’s quite 
tricky to compare assessment and funding sys- 
tems when the environments are so different. 


What other countries have unusual ways of 
directing and funding their research efforts? 
One extreme example is Singapore, which 
has a highly planned and directed system. 
Singapore is seeking to build very strong 
research organizations and is offering signifi- 
cant incentives to encourage the best brains to 
go and work there. Another interesting case is 
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Australia, which targets its research resources 
in its strongest areas, such as environmental 
science, or those in which there is consider- 
able national need, which in the recent past has 
included production methods that can accom- 
modate competing demands for soil and water. 

You also have to make a judgement about 
whether one of the purposes of research is to 
support broader innovation and business. The 
United Kingdom is second only to Switzerland 
in the Global Innovation Index 2014 (an annual 
ranking of countries by innovation metrics). So 
depending on what they are trying to achieve, 
countries may need to look beyond their 
research funding systems, perhaps to associated 
ways to translate research into jobs and other 
economic benefits to achieve the biggest impact. 


Should other countries be worried that China 
has increased its research and development 
funding by an average of 23% per year for the 
past decade? 

The United Kingdom is fortunate enough to be 
extremely productive, which is partly because 
of the maturity of our system. If we want to 
maintain our current position, then we have 
to ask: ‘Are we are investing enough?’; and ‘If 
we maintain our current level of investment, 
will the best people be attracted to work else- 
where because of the greater investment that 
others are making?’ Our success is partly due 
to the people we attract from other countries, 
so yes, there is a threat posed by others having 
more funds for investment. We have to con- 
sider at every stage whether we have sufficient 
resources to attract and support the best people. 


To what extent is China’s rapid rise in high- 
quality research output a direct consequence 
of its sustained increases in funding? 

T’m sure that increased investment supports 
that and I’m also sure that there are other 
factors in play, but we don't yet have enough 
evidence to understand what those factors are. 
It would be helpful to understand not just the 
role of increased investment on results, but also 
how you get those results. We could all learn 
about using money more effectively. 


Should other countries follow China’s example 
by increasing research funding? 

You've got to do analysis on what the outputs of 
your research are, whether that is publications 
or new technologies. It’s always good to invest 
in developing new knowledge but there is 
competition for scarce funds from other areas 
such as education, health and defence. You can 
only determine whether the potential benefits 
are worth a proposed increase in investment 
based on the particular problems a coun- 
try is facing and on the productivity of your 
research system. Ifa country’s productivity is 
low, it might be difficult to justify additional 
investment in academic research. = 


INTERVIEW BY NIC FLEMING 
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DALIAN INSTITUTE OF CHEMICAL PHYSICS: 
A PROVEN RESEARCH CENTRE OF EXCELLENCE 


alian Institute of Chemical 
Physics (DICP) is located in the 
beautiful port city of Dalian, 
China. Since its founding in 1949, DICP 
has built up an impressive portfolio of 
achievements in the basic and applied 
sciences that have directly impacted the 


economic and technological develop- 
ment of China. Traditional areas of excel- 


lence at DICP include catalysis, chemical 
engineering, chemical lasers, molecular 
reaction dynamics, organic synthesis, 
modern chromatographic techniques 
and biotechnology. 

DICP is composed of eight laboratories, 
each having unique research approaches 
and imperatives. While having distinct 


and diverse capabilities, technologies 
and equipment, all of the laboratories 
work together in unison to accomplish 
important national directives. 

With over 1,000 staff members, 
950 graduate students and 100 post- 
doctoral scholars, the research efforts at 
DICP are extensive. Many outstanding 
scientists and engineers have chosen the 
institute as a home base where they can 
develop strong skills and enhance their 
domestic and international reputations. 
DICP currently boasts 17 members of 
he Chinese Academy of Sciences (CAS) 


and Chinese Academy of Engineering 
(CAE) and 3 members of the Academy of 
Sciences for the Developing World. 


Making an impact 
Each year, programmes at DICP are be- 
coming increasingly recognized globally, 
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and domestic and international collabo- 
rations are expanding. The quality of the 
research output at DICP is reflected in 
part by the number of research articles 
published in high-level journals. In 2014, 
DICP researchers published eight articles 
in the prestigious scientific journals 
Science and Nature. 

DICP is developing an impressive 
number of valuable technologies for 
industry. In the past 15 years, DICP 
has been granted over 2,000 patents, 
placing DICP ahead of all CAS insti- 
tutes in terms of number of patents 
awarded. DICP’s innovative research 
focused on enhancing China's national 
economy and security has garnered 
widespread praise, with DICP faculty 
being awarded over 85 national awards 
and more than 230 provincial-level 
awards, including the National Supreme 
Science and Technology Award, which 
was awarded to Professor Cunhao Zhang 
in 2014. 


Focusing on modern energy 
challenges 

DICP is strongly focused on address- 
ing challenges related to sustainable 
energy through cutting-edge, collab- 
orative research in all of its laboratories. 
With a tradition of excellence in basic 
and applied research and 
plinary innovation, DICP is poised to 
continue to make important advances 
in a number of research fields. It is plac- 
ing special emphasis on energy-related 


interdisci- 


research, including the development 


of cutting-edge theories and technolo- 
gies for optimal fossil energy use, high- 
efficiency chemical energy conversion 
and renewable energy to meet national 
strategic and global needs. 


Dalian National Laboratory 

The largest laboratory at DICP is the 
Dalian National Laboratory for Clean 
Energy (DNL), which is China’s first na- 
tional laboratory dedicated to energy 
research. DNL, which consists of ten di- 
visions covering research areas such as 
fossil energy conversion to hydrogen 
energy and advanced materials, is al- 
ready playing a prominent role in China's 
national economy and security. 


Clean coal technologies 

Because coal is China's dominant energy 
source, the clean use of coal is critical for 
the nation’s sustainable development. 
Researchers at DNL are contributing to 
the country's clean coal effort through 
the development and refinement of a 
key process called DICP methanol to 
olefin (DMTO). This process efficiently 
converts methanol obtained from coal 
syngas (synthesis gas) to small alkenes, 
primarily ethylene and propylene. It has 
been commercialized through strategic 
collaborations between DICP and major 
industrial partners, including SYN Energy 
Technology Company Ltd and Luoyang 
Engineering Corporation, Sinopec. 
Several plants around China are produc- 
ing around 4 megatons per year of olefin 
products. In 2014, the DMTO process 
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was awarded first prize in the presti- 
gious Chinese National Technological 
nvention Awards. DICP has also made 
important progress on a variety of other 
conversion technologies, including DICP 
methanol to propene (DMTP), syngas 
o ethanol, syngas to high alcohols and 
synthetic natural gas. As with the DMTO 
process, DICP will soon be industrializing 
hese technologies. 


Alternative energy and 

energy storage 

In recognition that non-fossil fuel en- 
ergy sources will be necessary to meet 
China's future energy demands, DNL 
researchers are also making pioneering 
advances in alternative energy tech- 
nologies, including solar and biomass 
energy, fuel cells and energy storage. 
Scalable energy storage solutions for 
commercialization are a research prior- 
ity at DICP. Major innovations (with over 
100 patents filed to date) in flow battery 
materials, structure design of stacks and 
systems, and integration technologies 
have already been made. In 2013, DICP 
and an industrial partner demonstrated 
the successful operation of the world’s 
largest 5 MW/10 MWh all-vanadium 
flow battery storage system for a 50 MW 
wind farm in China. 


State Key Laboratory of Catalysis 

Advances in catalysis are unquestionably 
one of the most critical components of 
energy research. Using state-of-the-art 
experimental and _ theoretical tech- 
niques, the DICP State Key Laboratory 
of Catalysis is investigating catalytic 


reactions, creating advanced catalytic 
materials and developing jn situ char- 
acterization and monitoring techniques 
that are impacting energy research, as 
well as environmental and synthetic 
chemistry. Detailed probing of cata- 


lytically active phases, active sites and 


reaction mechanisms has led to sev- 
eral important insights and discoveries in 
nanocatalysis, including the synthesis of 
single-atom catalysts and the discovery 
of the interface confinement effect. 


State Key Laboratory of Molecular 
Reaction Dynamics 

Major breakthroughs 
search and many other research areas, 


in energy re- 


such as environmental and biologica 
chemistry, can be achieved through 
detailed understanding of the mos 
fundamental chemical processes. The 
State Key Laboratory of Molecular 
Reaction Dynamics (SKLMRD) is one of 
a handful of facilities in the world pos- 
sessing both advanced experimenta 
sophisticated com- 


equipment and 
putational capabilities. Using crossed 
molecular beam techniques, researchers 
at the SKLMRD combine experimental 


results with complimentary accurate 
quantum dynamics calculations to 
gain deep knowledge about important 
chemical reactions in the gas phase 
and at the gas-surface interface at the 
atomic, molecular and quantum-state- 


resolved levels. Knowledge of such 


mechanistic details can lead to new 
ways to approach technology develop- 
ment, such as designing catalysts for 
hydrogen production. 
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CAS Key Laboratory of Separation 
Science for Analytical Chemistry 
DICP’s expertise extends beyond energy 
to a wide range of research fields that ad- 
dress current global research challenges. 
The CAS Key Laboratory of Separation 
Science for Analytical Chemistry is a 
leader in the development of new meth- 
ods for sample preparation, separation 
and identification of species in complex 
samples, with a particular strength in the 
development of chromatography-based 
techniques and theory. The laboratory 
has also achieved recognition for devel- 
oping and refining techniques and in- 


struments for monitoring environmental 
pollutants; these include advanced air 
sampling technologies and detection of 
trace pollutants. 


Dalian Institute of Chemical Physics 
Chinese Academy of Sciences 

457 Zhongshan Road 

Dalian, China 116023 

Tel: +86-41 1-84379175 

E-mail: info@dicp.ac.cn 


www.english.dicp.cas.cn 
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SHANGHAI CENTER FOR SUPERCONDUCTIVITY 
CHINESE ACADEMY OF SCIENCES 


he Shanghai Center for 

SuperConductivity (SC’) was estab- 

lished in 2012 as a dedicated center 
of the Chinese Academy of Sciences (CAS), 
associated with the Shanghai Institute of 
Microsystem and Information Technology 
(SIMIT). It hosts approximately 75 staff 
members, including 29 full and associate 
professors. The centre performs excellent 
multidisciplinary research on supercon- 
ducting electronics and seeks to bridge 


the gap between fundamental research 
and real-world applications. Its research 
focus includes superconducting devices 
and circuits and system integration as 
well as applications of superconducting 
devices, novel quantum materials, elec- 
tronic structure characterization and first- 
principles calculations. 

SC operates a dedicated supercon- 
ducting device nanofabrication facility 
equipped with advanced electron-beam 
lithography and an i-line stepper. It is also 


constructing a comprehensive research 
platform for the analysis of electron struc- 
ture that will employ two state-of-the-art 
synchrotron beamlines. 


Major research directions 
Superconducting quantum devices: 
Conducting research into device phys- 

ics and developing superconducting 
quantum interference devices (SQUIDs), 
superconducting nanowire single-photon 
detectors (SNSPDs), novel quantum 
devices and circuits. 
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Frontier applications of superconduct- 
ing devices: Exploring applications of 
SNSPDs and SQUIDs, such as quantum 
information, communication and imaging 
based on advanced photon-counting tech- 
nology, biomagnetic imaging, ultralow- 
field magnetic resonance imaging and 
geophysical prospecting. 

Quantum materials: Investigating novel 
quantum materials including graphene, 
topological materials and superconducting 
hybrid structures. 

Electronic structure characterization 
of novel materials: Conducting research 
involving in situ soft X-ray surface char- 
acterization, analysis of the electronic 
structures of epitaxial materials, and first- 
principles calculations. 


Talent recruitment 

Position 1: Superconducting devices 

and circuits 

Job description: Develop superconduct- 
ing devices and circuits including single 
flux quantum (SFQ) devices, transition edge 
sensors (TESs) and qubits. 
Position 2: Device physics 
Job description: Carry out independen 
research on the physics of superconduc 
ing devices and circuits. 
Position 3: Quantum materia 
Job description: Conduct independent 
research on quantum materials with an 
emphasis on superconductors and super- 
conducting hybrid structures. 


> 


Ss 


Types of positions: Full professor, associate 


professor, assistant professor and postdoc- 
toral researcher 
Offer: Postdoctoral positions are offered 
salaries of up to RMB 150-200 thousand 
per year, depending on qualifications. The 
salary is negotiable for other positions. If 
suitably qualified, candidates can access 
further benefits through state, municipal 
and CAS talent recruitment programmes. 


Requirements: 
Doctoral degree in superconducting elec- 
tronics, quantum electronics, condensed- 
matter physics or equivalent. Independent 
work experience in relevant research fields 
for two to five years or longer, depending 
on position; 
+ Papers published in high-impact journals 
in relevant fields; 
+ Two letters of recommendation; 
+ Research proposal (template 
available on request); 
+ Good team spirit; 
+ Experience of doing research 
overseas preferable. 
Please send your resume and other applica- 
tion documents to: 
Prof. Dr. Lixing YOU (lxyou@mail.sim.ac.cn) 


ShanghaiCenterfor SuperConductivity 
Chinese Academy of Sciences 
Website: sc2.sim.cas.cn 
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Q&A Chen Xiangmei and Zeng Xuan 
Times are changing 


Two women, 18 years apart, describe their experiences within China’ research system. Chen Xiangmei (pictured, left) received her MD from 
Kitasato University, Tokyo, in 1986 and is now the director of the State Key Laboratory of Kidney Diseases of China at the PLA General Hospital in 
Beijing. Zeng Xuan (pictured, right) received her PhD in semiconductor physics from Fudan University, Shanghai, in 1997 and is now a professor at 
Fudan’s School of Microelectronics. 


In China’s high schools, there are usually 

more girls studying languages and humanities 
than science and maths, and there is a stigma 
attached to the term ‘female PhD’ such that 

it has become a derogatory way of referring 

to an educated woman who is focusing on her 
career. A lot of parents tell their daughters that 
girls are not as good at science as boys. Have 
you ever faced this type of pressure? 

Chen Treating women as if they are 
intellectually inferior to men has thousands 
of years of history in China’s major ethnic 
groups, such as the Han. There are many 
families that hold onto this traditional line of 
thinking, especially in the less developed parts 
of the country. Often, such families would 
choose to support their sons to pursue a career 
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that requires more years of education. The 
situation is usually more balanced in large, 
economically dynamic cities and in educated 
families. The percentage of female students at 
top universities is increasing, and a substantial 
percentage of women are majoring in science 
or maths. 

In 2001, I became a chief scientist funded 
by the 973 Programme, also known as the 
National Basic Research Programme, which 
is a prestigious funding scheme in China. 
I was the only female chief scientist of a 
973 Programme in 2001. We need more 
women at the top level of society, not just in 
science, but also in the army, in the govern- 
ment and every aspect of society, otherwise 
we are wasting talent. 
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Zeng I was once that rare species: a female 
science PhD candidate! My interest in 
physics started when I was in high school 
in the 1980s. I was lucky that my parents 
never discouraged me from pursuing sci- 
ence. I read about the female physicist Xie 
Xide, who was president of Fudan University 
in Shanghai at the time. Scientists were like 
rock stars in the 1980s in China, and we were 
all familiar with the saying: ‘Master math and 
science, then you have nothing to fear wher- 
ever you go. 

When I arrived as an undergraduate 
at Fudan University, my first choice was 
biomedical electronics but I was transferred to 
microelectronics. I was worried initially, but 
then I learnt that the microelectronics major 


LEFT: PLA GENERAL HOSPITAL; RIGHT: CHINA YOUNG WOMEN SCIENTISTS’ AWARD 


had been created by Xie, which inspired me 
to work really hard. I was fascinated from the 
first moment that I had access to computers 
and integrated circuits in the lab during my 
second year. I was fortunate to have many 
great mentors who nurtured my interest. No 
one discouraged me from studying science 
because Iam a woman. 


Are there the same opportunities for men and 
women to become scientists in China? 

Chen After the People’s Republic of China 
was founded in 1949, women in China were 
encouraged to work and were treated as equals 
to men legally, so I think there are the same 
opportunities for men and women to become 
scientists in China. But in real life, at least until 
now, women tend to spend more time raising 
children and taking care of their families. 
Zeng Among the top high-school students, 
undergraduate students and graduate 
students in life sciences and medicine, women 
make up a similar percentage to men. That’s 
less true in maths and physics, where men 
are still in the majority. So in many fields, the 
opportunity to become a scientist is equal 
for the sexes in China. At the very top, how- 
ever, there are far fewer women. I think that 
is going to change as society becomes more 
aware of gender inequality (see “The global 
picture of women in further education). 


What is the availability of career breaks, 
maternity packages and flexible working for 
scientists in China? 

Chen Nationwide, the minimum paid 
maternity leave is 14 weeks, but women who 
have babies later in life could have an additional 
15 or 30 days depending on the region in 
which they work. Women scientists have the 
same maternity packages as women in other 
professions, but it can bea real challenge for 
them to catch up with everyone else when they 
return to work. 

Zeng With regard to maternity, instead of 
providing a longer period of maternity leave, 
its more important for a female scientist to have 
access to flexible working. An extended break 
can be detrimental to a scientist’s career, but 
flexible working arrangements are still rare in 
China. Grandparents often step in to take care 
of the baby when a mother returns to work. 

I have been a visiting scholar to several 
US universities and I was told that when 
female researchers are being evaluated for 
tenure, they are given two extra years in their 
evaluation period to compensate for time 
lost while raising children. Similar policies 
are also appearing in the national funding 
programmes in China, but there is not any- 
thing at the institutional level yet. 


What about paternity leave — how common is 
it for men in China? 

Chen It would be great if men in China would 
take a larger portion of the parental leave! I’ve 
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THE GLOBAL PICTURE OF WOMEN IN FURTHER EDUCATION 


In the past two decades, China has boosted its share of women studying in tertiary education and, after 
overtaking its neighbour Japan in 2004, it is now in line with world averages. 
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Within Europe, Norway has the 
most women in higher education. 


@ The proportion of Chinese women in 
further education increased by nearly 
one-third in just under a decade. 


1994 1996 1998 2000 


heard of men taking paternity leave to take care 
of their newborns, but I've never met anybody 
who has done so in real life. Traditional values 
still hold sway in China, so even ifa man was 
willing to take time off to care for his baby, he 

would probably face 


“ : social pressure from 
F vgtaens Ce others. I hope that 
fobeconiea parents share the 

sae responsibility equally 
alee sey to take care of their 


children, but in reality, 
it is women who end 
up doing more. 

Zeng Men can take a few days of paid paternity 
leave immediately after the baby is born, but 
I, too, have never met any Chinese men who 
took substantial paternity leave. It seems that 
Chinese men are holding onto traditional 
values very tightly. 


sexes in China.” 


Are there biases in research evaluation that 
disadvantage women? 

Chen In most of China's research-evaluation 
systems, women are treated exactly the same 
as men, but women who take time off to have 
children are often disadvantaged compared 
with men of the same age. So the National 
Natural Science Foundation of China youth 
programme changed its application age limit 
a few years ago to allow female scientists under 
40 and male scientists under 35 to apply for 
its funding. This is a great way to mitigate the 
disadvantage faced by women. 

Zeng I don't think there are any policy biases in 
research evaluation that actively disadvantage 
women, but family responsibility can some- 
times slow a female scientist down. When I 
finished my PhD, there was a gap in professors 
of both sexes because the Cultural Revolution 
put higher education on hold for a decade and 
many new graduates chose to study abroad. 
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Fudan University’s administration trained the 
young researchers to become lecturers and 
helped them to establish their own laborato- 
ries. I was lucky to take advantage of that. 


What do you think would be the best way to 
encourage more women to pursue science? 
Chen I think the best change would be to 
move away from traditional values. Young 
people with potential and who have good 
career prospects should receive support from 
their significant others and family regardless 
of their gender. Also, I tell all my students that 
if they want to become successful physicians, 
they should spend more time and energy on 
their work. It is also vital that they commu- 
nicate effectively with their partners about 
what their jobs will require so they can share 
household duties. With support from their 
families, women can become outstanding 
medical scientists. 

Zeng I am concerned that so few youngsters 
in China today aspire to be scientists. That is 
partly because teachers do so little to stimulate 
interest in maths and science, especially 
anything that directly appeals to girls. Students 
are more focused on simply getting good 
grades. The lack of encouragement at an early 
age may explain why there are very few women 
in maths and physical sciences in higher 
education. When I was studying geometry 
in high school, students who could draw neat 
lines to solve geometry problems were deemed 
smart and those who could not were deemed 
not smart. Years later, I met a mathematician in 
college and learned that the ability to draw such 
lines is not a good measure of ability: analytical 
geometry is better. We need teaching methods 
that will nurture girls’ interest in maths and 
physical sciences. m 
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tale of two systems 


Tensions between the old communist regime and modern market forces are hindering 
efforts to turn China’s scientific discoveries into commercial advances. 


BY PENG TIAN 


Jian. At the end of 2013, the engineer and 
former vice-president of the prestigious 
Zhejiang University in Hangzhou was arrested 
on suspicion of embezzling state-owned assets, 
and was formally charged in February 2015, 
just before Chinese new year. Chu was co- 
founder and director of Supcon Group, an 
engineering company based in Hangzhou in 
Zhejiang province. A major part of Supcon 
Group was Haina Software, a former subsidi- 
ary of Zhejiang Haina Science and Technology 
Company, a spin-off company from the state- 
owned university. Haina Software was sold 
to Supcon Group by 2003, and one of Chu’s 
alleged crimes is transferring the better 
performing assets from Zhejiang Haina to 
Supcon, leaving poor-performing ones behind. 
Chu’s story is not unusual. Dozens of uni- 
versity officials and entrepreneurs are being 
swept up in China’s ongoing anti-corruption 
efforts. But getting a clear picture of the extent 
of the corruption is hampered by difficulties in 


I: is definitely ‘the worst of times’ for Chu 
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penetrating the murky rules surrounding the 
commercialization of state-funded research. 
Even as China is pushing to translate more 
basic research into societal benefits, this lack 
of transparency is preventing many would-be 
entrepreneurs from taking the plunge. 


WESTERN INFLUENCE 

Chinese universities and research institutes are 
reforming their research processes and evalu- 
ation systems to meet the higher standards of 
the international academic community and 
become more like Western research systems — 
particularly that of the United States. And toa 
certain extent these reforms are working: with 
better training and more resources, an increas- 
ing number of Chinese scientists are getting 
published in higher quality publications 
such as those covered by the Nature Index 
(www.natureindex.com). 

The reform process is being fuelled by 
investment from local and central govern- 
ments. China spent more than 1 trillion yuan 
(US$165 billion) on research and development 
(R&D) across all sectors in 2013 — second 


© 2015 Macmillan Publishers Limited. All rights reserved 


only to the United States (see page $8) — and 
has the second largest output for academic 
publications and patents (see ‘Rising output). 
The hope is that basic research will feed into 
inventions and improvements for industry, 
and eventually boost the economy. 

But when it comes to turning scientific dis- 
coveries into technological innovation, the 
results are disappointing. In the biomedical 
industry, for instance, most domestic drug 
companies produce either generic versions 
of existing drugs or drugs based on Chinese 
traditional medicines, which struggle to get 
into the regulated markets of developed coun- 
tries. Even Xinhua, the official press agency of 
China, has criticized the technology transfer 
capabilities of the country’s research system. 

As the reforms of publicly owned research 
institutions continue, they throw up more 
uncertainty. “The atmosphere is changing 
in both universities and Chinese Academy 
of Sciences institutes — people can feel that 
things are coming,” says Wang Liming, a 
32-year-old neurobiologist at Zhejiang 
University. Wang, like many researchers of his 
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RISING OUTPUT 
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China’s science and technology output continues to rise steadily, in terms of both numbers of publications in the science citation index (SCI) and of patent 
applications and grants. Yet there is no clear path for translating this technology to improve industry. 
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generation, has entrepreneurial aims. But he 
acknowledges that in these unsettled times, 
many researchers will cling even more tightly 
to the old system. “Getting a publication in 
journals with a high impact factor becomes the 
priority for scientists when they feel unsafe,” he 
says, “because they know it will probably help 
them survive any kind of top-down evaluation 
led by the administration.” 


OBSTACLE COURSE 
In response to criticisms of China’s inability to 
commercialize its research, the government 
green-lit a 1-billion-yuan National Centre for 
Translational Medicine in October 2014. This 
centre is to be in Ruijin Hospital, which is affil- 
iated with Shanghai Jiaotong University, and 
is expected to complete construction in 2017. 
The Shanghai centre is the first in a series 
of projects for translational medicine from 
the government’s National Science and 
Technology Infrastructure programme. There 
are four sister institutions planned, to be built 
in influential hospitals or medical colleges: two 
in Beijing, and one each in Xian in Shaanxi 
province and Chengdu in Sichuan province. 
The main goal of the translational medicine 
centres is to more efficiently transform basic 
biomedical research into clinical applications, 
including new diagnostics, treatments and 
drugs. But many obstacles stand in the way. 
First is that although reform of China's scien- 
tific research system is under way, reform of 
its public hospital system is lagging behind. 
Around 90% of China’s health care is provided 
in public hospitals. In theory, most public 
hospitals are wholly owned and financed by 
local and central governments, which strictly 
control staff numbers and salaries. But many 
hospitals have said that they do not receive 
enough funding and need to find additional 
resources. There have been reports of hospital 
staff allegedly accepting bribes from drug or 
device firms; one high-profile case involving 
British pharmaceutical giant GlaxoSmith- 
Kline, based in London, was exposed in 2013. 
Recruitment is another issue. “Public 
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2009 2010 2011 2012 2013 


hospitals are still in the old system, where 
there is less mobility of personnel,’ says Chen 
Zhiguo, a young neuroscientist at Xuanwu 
Hospital in Beijing. Hospitals such as the new 
Shanghai translational centre, which are affili- 
ated with universities or medical colleges, may 
be able to get around this barrier by recruiting 
staff through their affiliates. But even these 
institutions often retain ties to the old system 
— and its policy and culture. Researchers at 
Xuanwu Hospital, including Chen Zhiguo, are 
obliged to publish a certain number of papers 
each year in journals covered by the Science 
Citation Index. This require- 
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his research to the market and 

gets economic benefits, the administration will 
loosen the requirement of publishing papers,” 
Chen Zhiguo says. “But before that, you have 
to cope with it” 

Then there are questions over financing. 
Because there is no Chinese equivalent of the 
United States National Institutes of Health 
(NIH), which funds clinical trials, researchers 
wishing to develop drugs must look 
elsewhere. “More than one-half of China’s 
clinical trials are supported by drug com- 
panies,’ says Li Mengfeng, vice-president 
and dean of Zhongshan School of Medicine 
at Sun Yat-sen University in Guangdong. 
Some local governments and universities are 
trying to encourage clinical research that is 
initiated instead by independent investigators. 
One example is the 5010 Clinical Research 
Foundation, established in 2008 at Sun 
Yat-sen University. But there is nothing yet on 
the national level, says Li. 

One attempt to bridge the funding gap is 
the Shanghai Guangci Translational Medi- 
cal Research Development Foundation, 
which was established in September 2014 by 
Chen Saijuan, director of the translational 
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centre, and her husband Chen Zhu, China’s 
former health minister. The main goal of the 
foundation is to finance clinical research. The 
two have successfully worked together before, 
when they led a translational medical research 
project in China that resulted in a new treat- 
ment for leukaemia. 

Even if all the obstacles could be overcome, 
these new translational centres are only one 
link in the “bench to bedside’ chain. “Every 
link of the chain has problems in policy and 
culture,’ says Li, who adds that China is still 
weaker than other developed countries in 
basic research, and that the administrative 
side of the scientific community hampers 
creativity. “Basic research is not driven by the 
realistic needs of clinical practice, it’s about 
getting more impact factors.” And, even if a 
scientist has a strong talent for innovation, the 
environment is not supportive, he contends. 
“Governments and institutes in China do not 
give scientists enough incentives to bring their 
research to application.” 

Successful translational research depends on 
many factors, says Xiao Ruiping, a physiologist 
and director of the Institute of Molecular 
Medicine at Peking University in Beijing. As 
well as providing incentives for researchers, 
other important elements to ensuring succes 
are improving laws and regulations concern- 
ing research commercialization, and ensuring 
there is availability of capital and appropri- 
ate industry infrastructures. “The problems 
inherent in China’s translational medicine 
process cannot be resolved by a few scientists 
and projects,” says Xiao. “How can these 
systematic defects be overcome?” 


STATE OWNERSHIP CONUNDRUM 

Other countries have been through the process 
of promoting commercialization of the fruits 
of publicly funded research. The key piece of 
legislation in the United States is the Bayh- 
Dole Act of 1980. Prior to this law, any inven- 
tions derived from federally funded research 
belonged to the government. Bayh—Dole gave 
those universities, institutes and companies 
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who received federal grants the right to profit 
commercially from their inventions. 

China has introduced similar polices, and 
has solidified them into formal rules in the 
Science and Technology Progress Law of 
2007, but has had less success than the United 
States at fostering the commercialization of 
university invention. The reason lies in the 
different ways the two countries treat their 
universities and research institutes — in the 
United States these organizations have more 
independence from the government than their 
Chinese equivalents, and also have autonomy 
as market participants. China’s universities 
and institutes, are all controlled by the gov- 
ernment. Any economic activities must be 
approved by the administration of state-owned 
assets, which can hinder attempts by would-be 
entrepreneurs to commercialize their research. 

Such a situation is facing Chen Zhiguo, who 
was hoping to transform his neural-stem-cell 
research into a clinical treatment when he 
returned to China in 2012 having completed 
a postdoc at Stanford University in California. 
Two years on, the results from Chen Zhiguo’s 
animal experiments are promising, but 
because his research is funded by the govern- 
ment the project will need additional capital 
to move into clinical trials. Securing capital 
means transferring intellectual property, 
which is controlled by the hospital, which in 
turn is controlled by the state. “There are many 
difficulties” Chen Zhiguo laments. “How can 
we balance the interests of the hospital, the 
start-up and the researchers?” 

The biggest problems are that current 
regulations concerning technology transfer 
are unclear and authority oversight is unpre- 
dictable. A young university researcher, who 
wishes to remain anonymous for fear of reper- 
cussion from his university administration, 
explains: “If I want to transfer my research 
achievements to the pharmaceutical industry, 
my only choice is to cross a ‘grey zone’ when 
it comes to intellectual property regulations. 
There is no clear legal track, so infractions 
are inevitable.” It is impossible, he continues, 
for a scientist-entrepreneur to avoid these 
traps forever, as Supcon’s Chu found out. 
“Faults could be investigated by the govern- 
ment or prosecutors in the future, particularly 
if a company becomes big and successful.” 
Consequently, many inventors simply avoid 
the maze of policies and regulations. If they are 
unwilling to transfer their innovations through 
‘grey zone’ agreements, their only choice is to 
abandon their inventions altogether. 

There is plenty of entrepreneurial spirit 
in China. “Chinese scientists want to be 
rich and famous,” says Xiao, who wants a 
“clear and stable system” that enables start- 
ups to flourish. The research outputs of 
Xiao’s laboratory belong to Peking Uni- 
versity, whose Office of Science and Tech- 
nology Development is responsible for 
transferring her research to industry. “The 
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Chu Jian in 2006, who was then the vice-president 
of Zhejiang University in Hangzhou. 


office is a part of the administration depart- 
ment of Peking University and does not have 
its own intellectual property lawyer,” she 
says. Xiao worked for many years at NIH, 
and contrasts the situation in China with 
that of the United States. “I had nothing to 
worry about in terms of commercializing 
my research there. The NIH has a strong 
patent office that takes care of intellectual 
property matters. Scientists in China have 
to deal with things by themselves, and the 
patents are often fragmented.” 

The research community in China wants the 
government to acknowledge the contribution 
made by scientists working on publicly funded 
projects and to give them stronger incentives 
to carry out research with potential commer- 
cial impact. For research scientists, ownership 
priority over their inventions is an important 
step in that direction. Speaking at the Research 
Assessment and Evaluation symposium in 
Shanghai in October 2014 (see page S10), 
Shen Wenqing, former deputy director of the 
National Natural Science Foundation of China, 
said: “The most important reform is reducing 
the administrative intervention from the gov- 
ernment. At the local level, some provinces are 
trying small-scale reforms: for instance, since 
2014, a research team in Hubei province can 
now own up to a 99% share of the benefits of 
their research. But this applies only to projects 
funded by the provincial government of Hubei 
— those funded by the central government 
allow for no such ownership. 


PARADOX OF TRANSITION 

The shadow of the old system still hangs 
over every corner of China’s scientific 
enterprise. The balance of administrative 
power of governments and of market forces is 
a frustrating paradox. 
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Entrepreneurs may find themselves navigat- 
ing a complicated sea of regulations issued by 
different government departments, in many 
cases trying to get one department to intervene 
against the other. “The reality is that when you 
need them, they disappear and you cannot find 
them,” the young researcher says, “and they 
always appear where they should not be.” For 
example, the China Food and Drug Admin- 
istration (CFDA) has failed to remove tradi- 
tional Chinese medicine injections from the 
market, even though their safety and efficacy 
are unproven and people spend billions of 
yuan on them each year. Yet the CFDA requires 
start-ups to have their own production fac- 
tory and certificate of Good Manufacturing 
Practice, which makes it almost impossible for 
an innovative pharmaceutical company to get 
up and running. Government officials are try- 
ing to intervene too much in market processes, 
explains Guo Chongqing, a research engineer 
at Tongji University in Shanghai. “Officials in 
governments do not have crystal balls. How 
could they plan the way that industries will 
develop?” 

The downstream businesses that are 
essential in supporting the translational medi- 

cine ecosystem are also 
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the technology, says 
Tang Yanmin, director 
of the Beijing office of Morningside Ventures, 
a venture capitalist headquartered in Hong 
Kong and one of the few firms that invest 
in early stage biotechnology in China. Tang 
adds that Chinese venture capitalists gener- 
ally prefer the safer returns from investing in 
generic drugs and medical devices, based on 
cost advantage and state protection. 

Tang believes that the returns for innovative 
drugs will rise in the future as domestic phar- 
maceutical companies complete more original 
R&D and respond to the government's empha- 
sis on advancing the industry. The fruits of 
China’s burgeoning research are ripe for har- 
vest. But at the moment, with so little clarity 
on how to proceed, concerns about corruption 
charges — such as those facing Chu — are too 
serious to ignore. “Hoping that the govern- 
ments will demolish all the invisible walls 
and draw clear lines overnight is unrealistic, 
says Zhejiang’sWang. He understands why so 
many scientists who tried to commercialize 
their research gave up, but he is determined to 
succeed. “Yes, there are still many obstacles 
from the old system,” says Wang. “But I want 
to get things done.” m 


Peng Tian is a freelance journalist based in 
Beijing. 
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Zhejiang University 
SEEKING THE TRUTH AND PIONEERING NEW TRAILS 


Zhejiang University (ZJU), one of the leading research-intensive universities in China, is 


ocated in the beautiful city of H 


angzhou. Research at ZJU spans 12 academic disciplines: 
philosophy, economics, law, education, literature, history, art, science, engineering, agriculture, 
medicine and management. Of the 22 subjects listed in Thomson Reuters Essential Science 
ndicators, ZJU is rated in the top 1% in the world for 16. 


As a national university, ZJU is building itself into an innovative and comprehensive research 
institution of world-class standing. ZJU seeks the truth and pioneers new trails. It is dedicated 


to creating and disseminating knowledge, advancing civilization, serving and leading society, 
and promoting national prosperity, social development and human progress. 


ZJU sincerely invites talented researchers 
to join it so that together we can create a 
glorious future. 


Contact: 

Talents Office, Human Resources Department 
Zijingang Campus, Zhejiang University 

No. 866, Yuhangtang Road, Hangzhou 
Zhejiang, 310058, P. R. China 


Tel: +86-571-88981390, 88981345, 88981656, 
88981453 (Postdoc) 

Fax: +86-571-88981976 

E-mail: tr@zju.edu.cn 


http://talent.zju.edu.cn 


Why did you choose Zhejiang University? 


“Zhejiang University acts 

as a bridge to the exciting 
and fast-growing Chinese 
market. It allows researchers 


to harvest real-life problems 
[ a | and provides abundant 
opportunities for applying research outcomes to 
real systems. Besides providing opportunities to 
work with the best students in China, Zhejiang also 
has an excellent research environment that nurtures 
world-class research.’ 

Professor Wenyuan Xu, College of 

Electrical Engineering 


“For a young researcher, 
China is a fascinating country 
to begin an independent 
career. The creativity and 
pace of modernization will 


impress everyone who visits, 
and having a position here gives you the opportu- 
nity to focus on what really matters: research and 
the education of a new generation of scientists. 
Zhejiang University is one of the top universities in 
China and provides starting faculty with excellent 
working conditions. 

The Department of Chemistry supports its young 
professors wherever it can, gives them much free- 
dom and welcomes interdisciplinary collaboration. | 
am happy to have brilliant and helpful colleagues as 
well as smart, hard-working students.” 

Research Professor Simon Duttwyler, 
Department of Chemistry 


“As a former PhD student 
at Zhejiang University, | 
have always valued its 
strong support net- 

work, so after doing two 
postdoc programmes 
here, when the opportunity came for me to 
stay and teach, | took it without hesitation. | 
am proud to be on the faculty of the College of 
Agriculture and Biotechnology and | feel like | 
can never repay my alma mater for transforming 
me from a young boy into the quality teacher 
that | am today.” 

Lecturer Imran Haider Shamsi (Ying Lan), 
Department of Agronomy 


“| decided to return to 
China in 2009 after 
working abroad for 15 
years as | believe that 
China is the best place to 
find sustainable devel- 
opment solutions to address our planet’s 
limited resources. Indeed, the Yangtze River 
Delta, where Zhejiang University is located, is an 
area that has prospered continuously for 
thousands of years without major environmental 
decay. In particular, | enjoy the freedom to 
pursue my area of interest as a researcher 

and the open-minded atmosphere at 

Zhejiang University.” 

Professor Xiaogang Peng, Department 

of Chemistry 


“Although | officially joined 
Zhejiang University in 2011, 

| have been collaborating 
with researchers at Zhejiang 
University since 2002. | 
have found it to be a place 
with great academic freedom, and | believe 

that’s the reason why Zhejiang University is one 
the best universities in China. In addition, | am 
grateful to be able to work in the beautiful city of 
Hangzhou, where | grew up. Zhejiang University 

is also strong in engineering, which provides a 
fantastic platform for physical science research 
and interdisciplinary collaboration — activities 
that are of particular importance to me.” 
Professor Fuchun Zhang, Department of Physics 


“Zhejiang University’s lo- 
cation in the economically 
dynamic region of the 
Yangtze River Delta provid- 
ed much opportunity for 


translational research and 
a large patient base for biomedical research.” 
Professor Lanjuan Li, Director of the State Key 
Laboratory for Infectious Diseases Diagnosis 
and Treatment 
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INSIDE VIEW 


‘insideview 


profile feature 


Lanjuan Li, Director and Chief Scientist of the CCID 


Lanjuan Li is the director and chief scientist of the CCID as well as an academician at the Chinese 


Academy of Engineering, a professor of Zhejiang University and the director of the State Key 


Laboratory for Diagnosis and Treatment of Infectious Disease. 


CCID was founded in May 2012 as a non-profit research centre with the goal of promoting both 


basic and clinical research into infectious diseases. It was established through the collaborative 


efforts of five institutions in mainland China and Hong Kong: the Collaborative Innovation 


Program of the Ministry of Education of China, the State Key Laboratory for Diagnosis and 


Treatment of Infectious Diseases of Zhejiang University, the State Key Laboratory for Infectious 


Disease Prevention and Control of the Chinese Center for Disease Control and Prevention, the 


State Key Laboratory for Emerging Infectious Diseases of Hong Kong University, and Beijing 


National Engineering Research Center for Biochip Technology of Tsinghua University. 


Q: Why is a multidisciplinary 

approach important? 

CCID has five main research focuses: basic 
and clinical research of emerging infectious 
diseases; the development of an alert system 
and an emergency response system for major 
infectious disease; research into mechanisms 
of how acute viral hepatitis turns chronic and 
severe; the treatment of severe liver failure 
using the combination of artificial liver and 
liver transplantation; and investigation of the 
relationship between infectious disease and 
human gut microecology. 


Q: Why is a multidisciplinary approach 
important? 

Since infectious diseases have complex 
pathogens and their transmission often 
involves multiple regions and species, their 
diagnosis and treatment often require 
multidisciplinary expertise. By establishing 

a collaborative innovation centre among 
several research institutions, we can better 
study the cycle behind infectious disease 
development and the interaction between 
the body’s immune system and pathogens. 
We need the ability to rapidly source the 
pathogens of emerging infectious diseases and 
establish an effective early warning system 
and a systematic approach to diagnosis and 
treatment. 


Q: What work are you doing in the area 
of liver failure? 

My research team developed a hybrid artificial 
liver support system that provides plasma 
exchange, hemoperfusion, hemofiltration 

and continuous hemodiafiltration that can 


be adjusted to the conditions of an individual 
patient. This system is now widely used in 
China, and we are continuing to improve it. 


Q: H7N9 avian flu continues to receive a 
lot of attention internationally. 

How has CCID contributed to the 
research of H7N9? 

Of all the provinces in China, Zhejiang 
province experienced the largest number of 
H7N9 cases since its initial outbreak in March 
2013, and many of patients were treated at the 
First Affiliated Hospital of Zhejiang University. 
Together with our collaborators, we published 
a clinical analysis and characterization of viral 
genome in the journal Lancet in April 2013 
and clinical findings for 111 cases of H7N9 

in the New England Journal of Medicine in 
May 2013. As soon as we realized that H7N9 
was being transmitted from live poultry to 
humans, we advised the government to close 
live poultry markets. In addition, we have 
studied the genetic mutation and cytokine 
storm of H7N9 and a developed vaccine 
against the virus. 


Q: What were the findings of research 
into the changes of the human gut 
microbiome in liver cirrhosis patients? 
Previous studies had suggested an association 
between liver diseases such as cirrhosis and 
changes in the composition of the human 
gut microbiome, but definitive associations 
between gut microbes and afflictions of 

the liver were lacking. We performed a 
microbiome-wide association study of stool 
samples from 98 liver cirrhosis patients and 
83 healthy controls of Chinese ethnicity. 


ADVERTISEMENT FEATURE 


Quantitative metagenomics analysis revealed 
75,245 genes that were significantly different 
in abundance between the cirrhosis patients 
and the control group. Many of these genes 
could be grouped into 66 clusters that 
represent cognate bacterial species. Of these 
66 clusters, 28 species were enriched in liver 
cirrhosis patients and most were of buccal 
origin (mostly Veillonella and streptococci). 
Based on this analysis, we identified 15 genes 
that could form the basis of a highly accurate 
discrimination index for diagnosis. 


Q: What are CCID’s future plans? 

CCID has five main active research areas, 
these include: (1) Basic and clinical research of 
emerging infectious disease, (2) Developing 
an alert system for major infectious disease 
and an emergency response system, (3) 
Clarifying the mechanisms behind how 
acute viral hepatitis becomes chronic and 
severe, (4) Combining artificial livers and 

liver transplantation to treat severe liver 
failure and (5) Examining the relationship 
between infectious disease and human gut 
microecology. CCID is using metagenomics 
and metatranscriptomics tools to clarify the 
role of microorganisms in the progression of 
liver diseases. CCID is dedicated to finding 
fresh strategies for diagnosing, treating and 
preventing diseases such as liver cirrhosis. 
Weare looking into clarifying the effects 

of antibiotics and probiotics on the liver, 

to explain the effects and mechanism 

of probiotics like lactulose on disease 
progression. We are also studying the 
alteration of gut microbiomes of liver failure 
patients in the hope of finding new candidate 
probiotics for liver disease treatment. 

We are seeking to expand our international 
collaborations. Members of CCID will be 
attending more international conferences to 
communicate and cooperate with researchers 
in different countries. We are hoping to 

draw on a wider range of expertise and aim 
to become the Chinese equivalent of the 
National Institute of Allergy and Infectious 
Diseases in the United States and the Pasteur 
Institute in France. We aim to significantly 
reduce the incidence of infectious diseases 
and the associated mortality rate in China. 
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DIAGNOSIS AND TREATMENT OF INFECTIOUS DISEASES 


n 2011, the Chinese Governmen 
introduced an_ initiative to promote 
cooperation between leading research 
institutions in China and abroad with a 
view to boosting the innovation capacity 


and accelerating the internationalization of 


China's higher education. 

Zhejiang University was awarded sub- 
stantial funding from this initiative, which i 
used to form the Collaborative Innovation 
Center for Diagnosis and Treatment of 
nfectious Diseases (CCID) in May 2012 in 
conjunction with Tsinghua University, the 
University of Hong Kong and the China 
Center for Disease Control. Lanjuan Li, 
he director of the centre is joined by 
senior principal investigators Jing Chen, 
Kwok-Yung Yuen, Shusen Zheng and 
Jianguo Xu, associated with the collaborat- 
ing institutions listed above. In 2014, the 
Chinese Government made CCID the only 
national-level collaborative innovation cen- 
tre for medical research. 


The five missions of CCID 

CCID seeks to research, prevent, diagnose 
and treat infectious diseases in China with 
he goal of creating solutions that have 
he greatest possible impact. The centre 
is currently focusing on the early warn- 
ing, prevention, pathogenesis, molecular 
mechanisms, diagnosis and treatment of 
infectious diseases — especially influenza 
and hepatitis. It has research platforms 
for researching infectious pathogens and 
providing warning and forecasting as well 


as conducting pathogenesis, diagnosis, 


treatment and bioinformatics. CCID also 
has the capacity to clinically evaluate new 
drugs for infectious diseases and perform 
animal studies and clinical diagnosis 
and treatment. 

CCID's first mission is to establish effec- 
tive early warning and monitoring sys- 
tems for new infectious diseases, detect 
trace pathogens of emerging infectious 
diseases, determine the mechanisms 
responsible for the sudden onset and 
subsequent intensification of diseases. 
In addition, it involves developing clini- 
cal treatment strategies and treatments 
and reducing the high mortality rates of 
patients with rapid disease progression. 

The centre's second mission is to utilize 
community — disease-prevention —_pro- 
grammes and epidemiological data cloud 
platforms to monitor the occurrence of 
emerging infectious diseases. CCID is also 
investigating the patterns behind the 
spread of disease with the aim of develop- 
ing forecasting and early warning systems. 
CCID is dedicated to enhancing the capa- 
bilities for rapidly identifying and prevent- 
ing infectious diseases in China. 

CCID's third mission is to study how viral 
hepatitis and other major infectious dis- 
eases become chronic and severe and can 
eventually lead to cancer. The centre also 
aims to establish early warning and forecast 
systems for chronic hepatitis B patients. 

To tackle the problems of severe liver 
failure and the high mortality rate of liver 
disease, CCID is striving to clarify the mecha- 
nism of liver failure induced by severe liver 


disease, conduct effective and individual- 
ized artificial-liver treatment and research 
liver transplantation to significantly reduce 
mortality. This is its fourth mission. 

The fifth mission for CCID is to system- 
atically characterize the key development 
stages, structure and function of human 
gut microflora of patients suffering from 
H7N9 influenza and other emerging in- 
fectious diseases, as well as liver disease, 
tuberculosis and other major infectious 
diseases. In particular, CCID will inves- 


tigate how the relations between the 
microecology, the host's immune system 
and metabolic changes in the hope of 
achieving a breakthrough in identifying 
gut microbiome dysbiosis. 


Looking for international collabora- 
tion and talented researchers 

Under the leadership of academician 
Lanjuan Li, CCID has a formed a pool of 
talented researchers with clinical and basic 
research experience across many disci- 
plines. It has already published 22 papers 
in journals with impact factors exceeding 
30, such as Nature, Lancet and the New 
England Journal of Medicine and holds over 
100 patents. CCID responded rapidly to the 
first wave of avian influenza A H7N9 and 
discovered novel liver cancer biomarkers. 
CCID is actively looking 

to recruit talented re- 
searchers from over- 
seas and_ establish 
more _ international 
collaborations. 
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Anthony Cheetham 
China still rising 


Materials scientist Anthony Cheetham, vice-president of Britain’s Royal Society, outlines how China’s approach to recruiting and funding scientists 
is driving steep increases in Chinese research productivity. 


What are your experiences of working with 
researchers in China? 

Ihave been to China about 15 times in various 
capacities. As a science adviser to the board 
of Unilever from 2000 to 2008, I visited the 
company’s lab in Shanghai at least once a year. 
From 2004, I ran the International Centre for 
Materials Research at the University of Cali- 
fornia, Santa Barbara; it was clear that science 
was really taking offin China, so building links 
there was one of my top priorities. I went on 
to co-chair the first US-China workshop on 
advanced materials, for which we took a del- 
egation of faculty members, students and post- 
docs to Beijing. After I moved to Cambridge 
in the United Kingdom, we continued the link 
with a UK-China workshop over here. 
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In which types of institution does the best 
scientific research take place? 

Much of the leading science is done in the 
institutes of the Chinese Academy of Sci- 
ences (CAS), which subsumes two different 
roles. It has academicians, like those at the 
Royal Society or other science academies, but 
it also runs roughly 100 institutes covering all 
facets of science and engineering. These are 
distributed around the country with some- 
thing of a concentration in Beijing and 
Shanghai. It is a little bit like the German 
system in which a lot of the outstanding 
research takes place in Max Planck institutes 
rather than in universities. Although most of 
the best science happens at CAS, China's top 
universities are also very impressive. 
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In which fields is the country strongest? 
Overall, the number of Chinese publications 
in leading journals is roughly comparable 
to that of the United Kingdom, France, Ger- 
many or Japan. What was striking was that 
when I looked at the trend from 2000 to 2014, 
although there were variations from year to 
year, there was steep upward growth in all areas 
for China. 

Among the major science categories, Chi- 
nese growth was steepest in chemistry, to the 
point where the country is now second in the 
world in terms of overall output behind the 
United States. In physics, China is compara- 
ble with the United Kingdom, perhaps slightly 
overtaking us, yet in life sciences it still lags 
behind — although again the gap is narrowing. 


JON ENOCH 


My interpretation is that China has invested 
heavily in science across the board, but it is 
easier to make rapid progress in a discipline like 
chemistry because you do not need the large, 
sophisticated infrastructure that you often do 
in physics. And, on the other hand, funding for 
the life sciences is particularly strong in Britain. 


Is the funding application system fair and 
effective? 

In 2010, I went to China as part of the first 
international evaluation of the National Nat- 
ural Science Foundation of China (NSFC), 
which is their equivalent of the US National 
Science Foundation and of bodies like the UK 
Engineering and Physical Sciences Research 
Council (EPSRC). I was impressed by the 
quality of the NSFC’s processes when it came 
to funding grants. We were a small commit- 
tee of halfa dozen international scientists and 
two or three Chinese scientists, and we were 
given unfettered access to anybody we wanted 
to speak to and given a large amount of data 
about trends in applications, about their vol- 
umes and pass rates. We were able to interview 
stakeholders in the whole process, including 
not only people from the NSFC, CAS and 
China's Ministry of Science and Technology, 


but also administrators and grant holders 
from universities. We found that the NSFC 
commands a huge amount of respect among 
Chinese scientists. 

Funding that goes through CAS is done 
differently, but nevertheless Bai Chunli, its 
president, is looking at ways to improve the 
evaluation of the performance of its institutes 
and individuals within them. CAS feels it has 
to work a bit harder to justify its large tranche 
of funding from the public purse. It has been 
seeking international advice on this, and the 
Royal Society sent a delegation over there a 
couple of years ago as part of that process. 


What specifically does the NSFC do best? 

It is doing peer review very well. The pro- 
gramme officers we met were outstandingly 
good. There was a lot of transparency in the 
system so that even 


when people did not “Young 

get funding, they felt ¢oiontists in 

they had been given Ching have 

a fair opportunity. incredible levels 
They are also very : 

: é of passion and 
proactive on scien- drive that we 
tific misconduct and dan! 7 
have an established “0M £ of ten ecemt 

the West. 


system for monitor- 
ing it and prosecuting 
cases. To be honest, if we had done the same 
in-depth evaluation of a funding body like the 
EPSRC, I suspect that the council would have 
come out of it with a similar or maybe even 
lower score than the NSFC. 


Do Chinese scientists face particular pressures 
relating to how their work is disseminated? 
Salaries and funding decisions are directly 
related to numbers of publications in high- 
profile journals. This emphasis on quantitative 
metrics also applies to the evaluation of can- 
didates for academic positions. I know from 
supervising Chinese postdocs that if they want 
to go back to China, they need a minimum of 
four or five publications in a well-defined list 
of high-quality journals, and that they must 
be either the first or a corresponding author. 

It is a competitive system and the quality of 
science coming out of China is getting better 
each year. I personally would like to see them 
placing greater emphasis on evaluating peo- 
ple and reading what a researcher has done, 
rather than ticking boxes and using the impact 
factors of journals as a surrogate for quality. 
I suspect that the number of people applying 
for positions is huge. If you are deluged with 
candidates, that might explain some of the 
rather crude tools used to filter them. 


Critics suggest that this focus on metrics 
encourages a culture of seeking quick, short- 
term wins. Do they have a point? 

There are different ways of responding to 
the pressures in the Chinese system. Certain 
things can’t be done quickly. There are people 
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everywhere who are interested in short-term 
success and I don't think the Chinese are any 
different in that respect. 

I do think that the emphasis on publications 
in high-impact journals can drive some bad 
behaviour. There is concern that in a minor- 
ity of cases it can encourage falsification of 
data and plagiarism. These things happen 
everywhere but are perhaps more likely when 
rewards are tightly linked to outputs. 


Compared to other countries, China spends 
proportionally less money on basic science 
and more on developing existing technologies. 
Does this make the country less likely to make 
fundamental discoveries? 

Based on the sheer volume of work its scientists 
publish in world-leading journals, it is 
ill-informed to think that China is not very 
good at discovery. These are not journals that 
publish derivative stuff. There may be more 
money going into developmental science, 
but the amount in absolute terms going into 
discovery science is probably higher than in 
most Western countries apart from the United 
States. The processes they have in place clearly 
enable them to support an ever-growing 
amount of high-quality science. 


How good are Chinese scientists at translating 
their scientific discoveries into applications 
that benefit society? 

China has a long way to go when it comes to 
capitalizing on its scientific base. The concept 
of gaining real economic advantages from its 
investment in science is more embryonic there 
than elsewhere. In the United Kingdom we 
have the Catapult innovation centres to help 
to do that, and other countries across Europe 
have similar institutions. Whether it is train- 
ing in entrepreneurship for scientists or help- 
ing them to start companies, these things are 
not as well developed in China. Venture capi- 
tal exists but it is nothing like the level in Brit- 
ain, let alone the United States. The Chinese 
recognize that their capability in science has 
outstripped their ability to take advantage of 
it, and they need to build up their capabilities. 


What could outsiders learn from the way 
Chinese researchers manage their science? 
We could learn a lot just by observing the 
speed with which they have gone from being 
minor league players 20 years ago to being in 
the very front ranks of international science 
today. It’s a phenomenal success story. Beyond 
the substantial and sustained financial invest- 
ment, what I can say from my experiences 
of taking Western students to workshops in 
Beijing is that they are generally amazed by the 
levels of enthusiasm of their Chinese counter- 
parts. There is no doubt that young scientists 
in China have incredible levels of passion and 
drive that we don't often see in the West. 


INTERVIEW BY NIC FLEMING 
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and development and huge investments in buildings and 

equipment, it is no wonder that Chinese scientists, officials 
and the general public are expecting great advances in science and 
technology. Yet the reality is that an increasing number of young 
researchers, including graduate students and postdoctorates, are 
choosing to escape from research — a situation that has become a 
topic of heated national debate. 

Many recruitment notices for senior research positions require a 
candidate to have a good publication record, impact factor and cita- 
tion metrics. Young scientists strive to have a paper published in Cell, 
Nature or Science — elite, peer-reviewed journals known collectively in 
China as CNS. A paper ina CNS journal is an unwritten requirement 
for a better salary or a promotion — especially in China’s top univer- 
sities and institutes. A CNS paper could change a young researcher's 
academic career — or even his or her life. 

Focusing evaluation on such inflexible 
terms leads us, as scientists at the beginning 
of our careers, to adopt utilitarian rather than 
academic motives. Rather than follow our 
natural curiosity for research, we choose small, 
straightforward projects so that we can obtain 
results quickly. We need a flow of publications 
to ensure continued financial support. But every 
researcher also wants the opportunity to make 
a CNS splash. 


(ex the decades of continued growth in spending on research 


HOW TO RETAIN TALENTED SCIENTISTS 

The irony is that although investment in science 
is rising in China across the disciplines, there are 
few funds or fellowships for newly graduated 
PhDs. The largest is the Young Research Fund, 
which represents about 20% of the budget of the 
National Natural Science Foundation of China (NSFC). However, each 
project receives only 240,000 yuan (US$38,000) over three years. The 
second option is a grant from the China Postdoctoral Science Founda- 
tion, in which a successful applicant can secure 50,000-80,000 yuan for 
two years work. Neither amount is sufficient to purchase the necessary 
laboratory reagents and consumables to conduct research. For this rea- 
son, many young researchers apply for overseas postdoctoral positions, 
giving some of their best years of research to other countries — and 
often staying there. 

As a consequence — and in contrast to most Western countries 
— research projects in China rely on graduate students rather than 
postdocs. Exacerbating this trend, many Chinese universities and 
institutes hire as faculty only those candidates who have overseas 
research experience. China is, in effect, outsourcing the training of its 
young researchers. But the reality is that most scientists who graduate 
with a PhD from a Chinese university will not receive overseas offers; 
their choice is to stay at home and find work in a domestic laboratory 
and be paid a low salary with little hope of promotion, or give up their 
academic career entirely to seek alternative employment. 

Thirty years ago, Ray Wu, a molecular biologist at Cornell University 
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ALTHOUGH 


INVESTMENT 


IN SCIENCE IS 


RISING 


IN CHINA, THERE ARE 


FEW FUNDS 


FOR NEWLY 
GRADUATED PHDS. 


- Give youth a chance 


Despite the growth in science spending and research output in China, 
young researchers are struggling to prosper, says Chuan-Chao Wang. 


in Ithaca, New York, initiated the China-United States Biology and 
Biochemistry Examinations and Applications (CUSBEA) programme, 
which allowed for the first time large numbers of young graduates 
from China to pursue PhDs at leading universities in the United States. 
Many CUSBEA students have since become leading biologists in the 
United States as well as in China. Following Wu’s death in 2008, the 
Ray Wu Memorial Fund established a prize in his honour to inspire 
and reward PhD students who wish to further their education at uni- 
versities in China, Hong Kong, Taiwan or Singapore. Unlike China's 
typical evaluation system, the Ray Wu Prize — an award of US$3,000 
plus the opportunity to apply for up to $5,000 to attend international 
conferences — does not focus on a student's history of journal publica- 
tions, overseas experience or examination results. 

Instead, the prizewinners, of which there could be ten or 
more each year, are determined by peer review by a selection 
committee consisting of eminent Chinese 
biologists who assess each candidate’s 
capacity for creative ideas, independent 
thinking and dedication to his or her field 
of interest. Applicants submit a statement 
describing why they chose to study science, 
the academic projects they would like to pursue 
at university, and their intended future career. 
They also have a face-to-face interview with the 
committee. In the years, or even decades, after 
the prizes have been awarded, the committee 
remains available for advice as the researchers 
develop their scientific careers. 

The Ray Wu Prize is a comprehensive and 
impartial evaluation process that also offers 
advice and support. Moreover, it is starting 
to be recognized by Chinese universities and 
institutes as a factor in faculty appointments. In 
2014, a scientist who was awarded the Ray Wu Prize was appointed as 
assistant professor (tenure-track) at ShanghaiTech University, despite 
not having overseas research experience. 

For decades, the minds of Chinese students have been shaped by 
passive rote education. But scientific progress requires not recitation 
of known concepts, but new ideas. The capacity to break the mould 
and truly innovate — the features that the Ray Wu Prize emphasizes — 
are crucial for the Chinese scientific community to deliver the results 
that the government and the people expect from their investment. 
The potential of new graduates cannot be fully reflected by publi- 
cation metrics or overseas experience and as a consequence, China 
is currently losing their contribution. China must invest more in its 
promising young scientists, and in more effective ways of measuring 
their achievements. = 


Chuan-Chao Wang is a PhD candidate in biological anthropology 
at Fudan University in Shanghai. In June 2015, he will become a 
postdoctoral researcher at the Max Planck Institute for the Science of 
Human History in Jena, Germany. He won the Ray Wu Prize in 2012. 
E-mail: cchao.wang@gmail.com 
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Q&A Jin Dong-Yan 
The global view 


Jin Dong- Yan is a virologist in the University of Hong Kong’s department of biochemistry. With a 
bachelor’s degree from Sun Yat-sen University in Guangzhou, a PhD from the Chinese Academy 

of Preventive Medicine in Beijing and a postdoctoral stint at the US National Institutes of Health, 
Jin benefits from the experience of working in vastly different research systems. 


Why did you decide to go to the United States? 
Twenty years ago, China had well-equipped 
facilities, especially in my field at my 
institution. The infrastructure to support 
fundamental research in biomedical sciences, 
however, was not well developed. I decided 
to do my postdoctoral research in the 
United States because the country was more 
advanced and had more world-class research 
opportunities than China. 


What brought you back to Hong Kong? 

First, 1am Cantonese, so Hong Kong was a 
natural choice for me and my family. Second, 
when I joined the University of Hong Kong 
(HKU) in 1998 [a year after it officially 
reverted to Chinese rule], the city had already 
developed an advanced infrastructure to sup- 
port basic research. I believed that Hong Kong 
was the best place to do science in Asia, and 
that I would do well. Third, HKU approached 
me with an attractive offer. 


What are some of the strengths and 
weaknesses of Hong Kong’s research system? 
The greatest strength is international peer 
review. All proposals submitted to Hong 
Kong’s Research Grants Council are subjected 
to a rigorous review process handled by a panel 
comprising world-renowned, independent 
experts. Unfortunately, because of limited 
resources, researchers in Hong Kong seldom 
have the chance to propose big projects. They 
have to be realistic and their projects tend to be 
more conservative. 


What is the tenure system at HKU like? 

HKU did not have a well-developed tenure 
system before I joined, but it was in the process 
of setting one up. Before that, people would 
be tenured immediately when they joined the 
university. That is no longer the case: today, 
people are given tenure only after the satisfac- 
tory completion of two three-year contracts, 
and there is no automatic promotion. In this 
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respect, the HKU tenure system has become 
more similar to that at universities in North 
America. Beyond awarding tenure, we also 
have a system of performance review and 
development to evaluate our faculty annually. 
For non-clinical staff, we look at teaching, 
research and community service; clinical staff 
are also evaluated by their clinical service. 


How does the university decide who to recruit? 
We look at publications, but that is just one 
of several important factors. For example, 
we look at whether the candidate’s areas 
of expertise complement what we already 
have. And during an interview, we evaluate 
communication and interpersonal skills. We 
need to find out whether this candidate is the 
one we really want and will enhance us in the 
years to come. 


What is the system like in mainland China? 
The mainland system has evolved over the 
years. When I left Sun Yat-sen University 
30 years ago, performance reviews for tenure 
and promotion were run by local research- 
ers, who might be influenced by personal 
connections. After many years of criticism, 
the system became more modern and inter- 
national. One of the improvements was to 
weigh journals’ impact factors when assessing 
the value of a researcher’s publications. This 
encouraged faculty members to publish 
papers in top international journals. However, 
the system went too far and placed too much 
emphasis on impact factors and not enough 
on international peer review. The abuse 
of impact factors is out of proportion now, 
and that is the main weakness of the current 
system in mainland China. 


What brought about these changes in the 
Chinese system? 

The main driver was that many Chinese 
scientists based in the United States criticized 
the Chinese system and suggested a change 
from rule-by-people to rule-by-merit. They 
found that funding and faculty promotion are 
decided in the US ina merit-based way, which 
they felt is superior to the system in China. The 
Chinese government finally took some of their 
advice. Unfortunately, the Chinese evaluation 
system has gradually evolved into not a merit- 
based but a metrics-based one. 


Is Hong Kong’s research system still better 
than mainland China’s? 

Yes, our system is more advanced and fairer 
because we are much closer to the international 
standard. Also, on the mainland there is still 
too much interference in research by the gov- 
ernment. That is not what scientists need when 
they are trying to do good research. Mainland 
China is making progress to improve its 
system, but there is still miles to go. 


INTERVIEW BY FELIX CHEUNG 
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POLYU: WHERE INNOVATION 


MEETS APPLICATION 


he Hong Polytechnic 
University (PolyU) firmly believes 


that research is an integral part of 


Kong 


academic life on campus. We are highly 
cognizant of the role technology plays 
in a knowledge-based economy and its 
growing importance in enhancing our 
well-being, economic growth and globa 
competitiveness. The — technologica 
breakthroughs fostered through our re- 
search initiatives have been pushing the 
boundaries of sustainable developmen 
and changing our world for the better. 
Over 2,700 research projects were 
conducted in 2013-2014. These ranged 
from ‘down-to-earth’ projects that touch 
daily lives to ‘up-to-space’ 


people's 
innovations that are expected to have 
far-reaching effects on the development 
of humankind. This research 
funding of HK$1,541 million and involved 
more than 1,000 academic staff and about 


received 


1,000 research personnel. 


Advancing knowledge and pushing 
technological frontiers 

Over the past few years, PolyU's research 
innovations have continued to break new 
ground and contribute to technological 
advances that benefit the world. 


Aviation Services Research Centre 

Since the opening of its first phase in 2013, 
the Aviation Services Research Centre 
(ASRC) has secured more than HK$70 
million in funding from the Innovation 
and Technology Fund as well as the par- 


ticipation of The Boeing Company, Hong 
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ong Aircraft Engineering Company Ltd 
and Hong Kong Aero Engine Services Ltd. 
The ASRC has now embarked on a 
number of aviation services research proj- 
ects that will address the pressing needs 
of the maintenance, repair and overhaul 
industry. The first project, Laser Projected 
Drilling Templates and Robotic Drilling, 
design a process using a portable laser 
drilling template that will save manpower 
and streamline the workflow of replacing 
parts on the outer skin of an aircraft. 
The second — project, 


Ww 


Mechanica 


Refurbishment of Aviation Parts, wil 
develop advanced methods for mechani- 
cally refurbishing aircraft parts that wil 
greatly cut maintenance costs. This wil 
be achieved by developing a multi-axis 
probing technology and the world's firs 
mechanized method for re moving distor- 
tions from turbine backing ring seals. 
Another ten projects are currently being 
undertaken. 


Railway technologies gain global 
recognition 
Interdisciplinary researchers from PolyU's 
Faculty of Engineering have been contrib- 
uting to railway technology advancement 
through optical fibre and ultrasonic wave 
sensing technologies for monitoring the 
structural health and safety of the nation’s 
rapidly expanding high-speed rail. 
Recently, the team’s efforts were rec- 
ognized by Berthold Leibinger Stiftung, 
a German organization that promotes 
research. The organization 
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awards every other year to researchers 
who have made important advances 
in the use of lasers. In 2014, Michael Liu 
Shun-yee, Tam Hwa-yaw and Ho Siu-lau 
from the Photonics Research Centre and 
he Department of Electrical Engineering 
were named second runner-up for adapt- 
ing fibre Bragg grating (FBG) technology 


for use in railway monitoring systems. 


Using light to monitor 
train conditions 
The new FBG 

developed at PolyU's Specialty Optica 
Fibre Fabrication Laboratory, are specially 
constructed to both detect and transmi 


sensors, which were 


information about a wide variety of 
problems. FBGs are built inside a fibre-optic 
cable. Their design allows them to reflec 
aser light of certain frequencies back to its 
origin while letting other frequencies pass. 
The new sensors take advantage of the fac 
hat any movement in the fibre-optic cable 
changes the shape of the sensor inside it, 
which in turn alters the frequency of the 


reflected light. A computer can analyse 
hese changes in the light to determine 
he type and degree of movement in the 
cable. The team initially installed their 
new sensors on the tracks at Hong Kong's 
rain stations with the sole purpose of 
counting the number of trains entering 
and leaving. Vibrations on the rails shake 
he sensors, deforming them enough to 
send signals to the monitors. While the FBG 
sensors performed this job very well, the 
researchers soon discovered more subtle 


uses for them. 
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This technology has many advantages 
over electronic systems. It is immune to 
the electromagnetic interference and 
lightning that can affect aboveground 


electric circuits, and the signals can be 
detected over distances as long as 100 
kilometres. Furthermore, the sensors are 
passive and thus do not require power 
supplies. The system only needs a laser 
to generate light and a spectroscope to 
analyse the returning signals. Finally, the 
research team estimates that hundreds 


of sensors can be integrated into a single 
length of fibre. 

The equipment is also physically 
robust. The fibres are made of glass and 
thus are non-flammable and not 
susceptible to corrosion, ensuring low 


maintenance costs. 


Building better rail networks 
In addition to Hong Kong's rail network, 
rail Operators around the world are 
beginning to adopt the new technology 
in their systems. The Beijing—Shanghai 
High-Speed Railway was the first to tes 
the technologies in 2011. More recently, 
Transport New South Wales in Australia is 
planning to install the sensors on its firs 
driverless trains. 


As a result of cooperation with Dalian 
Jiaotong University in China, the PolyU- 
developed FBG sensors have been in- 
stalled in high-speed inspection trains for 
monitoring the condition of rails. Another 


user, Southwest Jiaotong University, 
through its Railway Development 
Company, recently expanded _ its 


cooperation with PolyU's Base for 
Commercialization, Training and Research 


in Shenzhen. Having already collaborated 


on the installation of FBG sensors in high- 
speed trains on the mainland, the two par- 
ties signed an agreement in June 2014 to 
further advance the development of the 
high-speed rail network and technology. 


Fostering new perspectives in 

niche areas 

Building on a solid foundation of research 
strengths established over the years and 
the synergies achieved through multi- 
disciplinary collaboration, the university 
has been developing novel innovations 
and raising its profile both regionally 
and internationally. 


Developing sophisticated space 
tools - camera pointing system 
PolyU has been continuously expanding its 
space engineering technologies on several 
fronts. Recent successes include a PolyU 
designed and built camera pointing system 
hat was used to capture images of the 
oon’s surface by the Chinese lunar lander 
Chang'e-3, which landed on the Moon 
in December 2013. Jointly developed by 
PolyU researchers and the China Academy 
of Space Technology, it is the first Hong 
ong-designed and = -manufactured 
instrument to operate successfully on a 
celestial body. A lightweight motorized 
device, it moves the camera vertically and 
sideways so that it can capture images of 
the Moon and monitor the movement of 
the lander’s rover. All pictures sent back by 
the lander from the surface of the moon 


were taken with the support of this device. 

Wu Bo of the Department of Land 
Surveying and Geo-Informatics also con- 
tributed to the success of the mission, as 


his lunar mapping techniques were used 


in the topographical analysis and selec- 
tion of Chang'e-3’s landing site. 

A PolyU research team has also been 
invited to develop the surface sampling 
and packing system for the third phase 
of China’s Lunar Exploration Programme, 
which is to be launched in 2017. The team 
isled by Yung Kai-leung of the Department 
of Industrial and Systems Engineering and 
with the support of Robert Tam and his 
teammates at the Industrial Centre, who 
will participate in the mission of bringing 
lunar regolith (loose rock and dust above 
a layer of bedrock) back to earth. 
Experienced in international space ex- 
ploration, the PolyU team has developed 


many flight-qualified space instruments 
over the years, including the Mars Rock 
Corer for the European Space Agency's 
Mars Express Mission in June 2003. It also 
developed the Soil Preparation System 
(SOPSYS), which was installed in the 
lander of the Sino-Russian Phobos-Grunt 
mission in November 2011. SOPSYS con- 
ducted in situ analysis of Phobos rocks. 

At the 42nd International Exhibition of 
Inventions held in Geneva in April 2014, 
SOPSYS was awarded the Grand Prize and 
Gold Medal with Jury's Commendation. 
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A DECADE OF GROUNDBREAKING 


MEDICAL RESEARCH 


he Institutes of Biomedical Sciences 
(IBS) is a research centre estab- 
lished to promote cooperation 
between basic research and clinical stud- 
ies at Shanghai Medical College of Fudan 
University. Successively led by Fuchu He, 
Lin He, and Peng-Yuan Yang, IBS consists 
of three research units and a core facility 
conducting research in metabolic and mo- 
ecular cell biology, epigenetic medicine, 
birth defects and cancer biology, pro- 


eomics and systems biology for medicine, 
structural biology and innovative drug dis- 
covery. Researchers at IBS are encouraged 
0 collaborate with other departments of 
Fudan University as well as hospitals affili- 
ated to the Medical College. 


Metabolic changes caused by cancer 
Metabolic alteration is a hallmark of can- 
cer cells. The Laboratory of Molecular 
Cell Biology, co-founded and co-led by 
Xiong Yue and Kun-Liang Guan, and 
later joined by Qun-Ying Lei, Shi-Min Zhao, 
Dan Ye and Haixin Yuan has discovered 


that protein acetylation is an evolutionarily 
conserved post-transcriptional modifica- 
tion that regulates nearly every metabolic 
enzyme involved in diverse metabolic 
pathways. The team demonstrated tha 
protein lysine acetylation plays a key role in 
regulating metabolic enzymes and thus in 
maintaining metabolite homeostasis. The 
team further revealed the link between 
dysregulation of metabolism and mul- 


tiple types of human cancers. They found 
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that eight genes encoding for four meta- 
bolic enzymes are frequently mutated 
in cancers, thereby altering epigenetic 
regulation. The researchers are currently in- 
vestigating acetylation-based regulation in 
other cellular pathways and the metabolic 
changes that contribute to tumorigenesis 


for cancer therapy. 


Histone modification and 

DNA methylation 

Histone modification and DNA meth- 
ylation are critical mechanisms for epi- 
genetically altering gene expression. The 
Epigenetics Research Center, co-founded 
and co-led by Yang Shi and Yujiang Geno 
Shi, and later joined by Yanhui Xu and 
Fei Lan, has investigated the dynamic 
regulation of DNA methylation by the 
DNMT and TET protein families, which 
write and erase DNA methylation in 


he mammalian genome, respectively. 
Significantly, they used a structural ap- 
proach to uncover the substrate regula- 
ion mechanism of TET2 and reveal the 
mode of crosstalk between histone mod- 
ification proteins and de novo DNA meth- 
yltransferases (DNMT3A). The researchers 
are also using integrated approaches to 
develop small molecule inhibitors with 
potential therapeutic applications. 


Deciphering RNA and neuronal 
differentiation 

The Alastair Murchie laboratory focuses on 
characterizing ligand-sensing RNAs and 


the mechanisms and functions of RNA sen- 


sors in bacteria and eukaryotes. The labora- 
tory has identified and characterized a new 
class of aminoglycoside-sensing RNA that 
regulates the expression of aminoglyco- 
side antibiotic-resistance genes. This was 
the first report of antibiotic resistance con- 
trolled by an antibiotic-sensing RNA. 
Wenyu Wen’s laboratory focuses on 
the molecular basis of protein complexes 
that regulate neuronal differentiation and 


signalling. The laboratory has performed 
structural and functional studies on the dy- 
namic visual scaffold INAD and found hints 


of a novel mechanism by which scaffold 
proteins actively influence signalling pro- 
cesses by modulating the conformation- 
coupled redox potential. 

IBS has made great strides in improving 
our understanding of human health and 
disease and will continue to bridge the gap 
between basic research and clinical ap- 
plications. All of the research teams at IBS 


continue to recruit new faculty members 
and postdoctoral researchers to contribute 
significantly to the health needs of China 
and the world. 
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Brasilia. “Marina does not negotiate on 
principles,” she says. 

Others go a step farther. “I think she is a 
little too radical,’ says Pedro Alves Vieira, a 
geologist at the State University of Goids and 
environment secretary for the municipality 
of Goias. Although Silva has hewed to the 
centre during the campaign, Vieira fears that 
she may be too focused on the environment, 
too religious and too rigid for the presidency. 

But some of Silva’s 

Ithink it major stances have 

‘or evolved. As environment 

minister, she opposed 

the introduction of 

genetically modified soya beans, but has 

since sought to mend fences with the pow- 

erful agribusiness industry — stressing that 

modern agriculture, practised legally, is not 
at odds with forest protection. 

Her plan for sustainable development 
focuses on stemming Brazil's gradual shift 
toward fossil fuels: renewable energy fell 
from 95% of the nation’s energy mix in the 
1990s to 78% in 2013, according to her cam- 
paign. To reverse that trend, she is calling for 
the installation of solar panels on 1 million 
homes and the revitalization of the sugar- 
cane ethanol sector, which has struggled to 
compete with heavily subsidized petrol. She 
is also pushing for sustainable forest conces- 
sions geared toward biomass energy. 

By contrast, Rousseff has put more 
emphasis on dozens of large-scale hydro- 
electric dams that would increase develop- 
ment pressure on the Amazon rainforest. 
“We need more energy, but Brazil's cur- 
rent strategy is really based on big infra- 
structure,” says Mercedes Bustamante, 
an ecologist at the University of Brasilia 
who spent two years at the science min- 
istry under Rousseff. “I think it’s time for 
a change.” 

Science policy has received little atten- 
tion in the presidential campaign, although 
Silva has called for federal and private 
investments in research and develop- 
ment to expand from 1.1% of Brazil’s gross 
domestic product to 2%. The government’s 
science spending has increased over the 
past decade, but researchers say that it has 
also been spread thin by new programmes. 

Even if she takes the helm, Silva will have 
to tackle shifting political winds in Brasilia. 
In 2012, the nation’s congress voted to 
weaken the country’s forest law, and some 
lawmakers are now pushing to scale back 
protected lands to allow mining and energy 
development. 

But Schwartzman warns against under- 
estimating Silva's political skills should she 
win. “She and the others who worked with 
Chico Mendes in Acre really came through 
a crucible,” he says. “In some ways, looking 
back on that experience, I’m not surprised 
that she is where she is now.’ m 
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| POLICY | 
Chinese science gets 
mass transformation 


Teamwork at centre of Chinese Academy of Sciences reform. 


BY DAVID CYRANOSKI IN BEIJING 


( “that is coming to the institute 
that has been at the heart of China’s 
scientific development since the com- 

munist state began. The Chinese Academy 
of Sciences (CAS) is making unprecedented 
structural reforms to foster collaboration and 
turbocharge research. Proponents say that the 
initiative will make China a world leader in 
areas from neuroscience to particle physics; 
detractors question whether modern China 
needs such a sprawling organization at all. 

The CAS, headquartered in Beijing, employs 
60,000 people, and has 104 research institutes 
and a budget of roughly 42 billion renminbi 
(US$6.8 billion). Just under half comes from 
the government, to cover overhead costs and 
salaries; the rest comes from competitive 
government grants. It is “the biggest reform 
in the academy’s history”, CAS president 
Chunli Bai told Nature’s news team in an inter- 
view on 22 September. 

The initiative — which was designed by 
Bai — follows an appeal by Chinese presi- 
dent Xi Jinping in July 2013 for the CAS to 
become a world leader in science. Although 
China's spending on science now outpaces 
that of all countries except the United States, 
many agree that the country is not getting 
a good return on its investment, in terms 
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of both basic-science breakthroughs and 
commercialization of those findings. 

Some underperformance might result from 
low salaries for scientists — including those at 
the CAS, says Bai — who scramble to supple- 
ment their incomes with multiple grants. “Their 
research becomes very fragmented, he said. 

Scientists also shy away from collaborations 
because co-authorship dilutes their achieve- 
ments in the eyes of grant committees. Ata con- 
ference in 2012, Bai discovered that researchers 
across 20 different CAS institutes were working 
on related projects, but often did not know each 
other — duplicating work and missing oppor- 
tunities to share knowledge. He also notes that 
they had little interest in discussing applications 
with industry representatives. 

To deal with these issues, Bai has grouped 
research into four categories, and promis- 
ing CAS scientists are being selected to work 
together under them. They will see their sala- 
ries jump, relying on grants for only 30% of 
their income, rather than for 70%. The idea is 
to encourage scientists to collaborate on fewer, 
large problems, rather than to churn out mar- 
ginal advances in disparate projects that can be 
used to seek multiple grants. “Teamwork is the 
key word,’ says Mu-Ming Poo, director of the 
CAS Institute of Neuroscience in Shanghai. 

The first category is devoted to establishing 
centres of excellence focused on basic science, 
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and will target areas in which China has a 
chance to dominate. One team, for which 
Poo has already selected 40 researchers 
from 11 different CAS institutes, specializes 
in brain science — one goal is to produce a 
transgenic monkey model that could be use- 
ful for neurodegenerative-disease research. 

Tandong Yao, director of the Institute of 
Tibetan Plateau Research in Beijing will head 
another centre focused on the seismically 
active plateau: his team wants to probe how 
the crust underneath deformed over time. 
Other centres will focus on particle physics, 
thorium-based molten-salt reactors, and 
quantum information. The plan is to take 
the count up to 20 centres by 2020. 

A second category will target areas with 
underdeveloped commercial potential, 
including microsatellites, marine informa- 
tion technology and drug development. A 
third category will establish collaborations 
around large-scale facilities: a synchrotron 
and a protein-science centre, for example, 
both in Shanghai, will be linked to make it 
easier to carry out high-precision protein 
studies and support academic and industry 
researchers, including some from abroad. 

The fourth category, still in its design 
phase, will be devoted to initiatives that 
assist local development and sustainabil- 
ity. Its first will probably focus on work 
by the Institute of Mountain Hazards and 
Environment in Chengdu, which could 
help communities to prepare for natural 
disasters such as landslides. 

But not everyone agrees that the changes 
tackle the key problems at the CAS. One 
Chinese university scientist, who is famil- 
iar with the reforms and did not want to 
be identified, says that the source of the 
problems is that the CAS hires too many 
young scientists who then go through little 
or no review, and essentially receive tenure 
without having to prove themselves. Bai 
counters that the opportunities offered by 
his initiatives will “encourage the under- 
performers to be better”. He says that he 
has steered away from a US-style system in 
which underperformers can be fired: “It is a 
reform with Chinese characteristics.” 

Cong Cao, a science-policy specialist at 
the University of Nottingham, UK, says that 
the CAS is too big and unwieldy. “There's 
no organization in the world with so many 
functions,” he says. He says that universi- 
ties and individual research institutes, 
which are increasingly competing with the 
CAS for leading scientists, can offer a bet- 
ter management environment for scientists. 
“There probably is no reason for the CAS in 
China right now.’ 

But Bai uses the same trait to justify the 
academy's existence: he argues that it allows 
the CAS to make wide contributions to 
China's development. “You have to under- 
stand the history,’ he says. m 


INFECTIOUS DISEASE 


Global Ebola response 
kicks into gear at last 


US and UN mobilizations are welcome but overdue. 


BY DECLAN BUTLER 


r | Nhe international community is 
responding to the Ebola outbreak — at 
long last. On 18 September, the United 

Nations Security Council adopted a resolu- 

tion declaring the outbreak a “threat to inter- 

national peace and security”; two days before, 

the United States had said that it will send 3,000 

military personnel to West Africa and spend 

US$750 million to support civilian efforts. Both 

moves are unprecedented for a disease outbreak 

and just what is needed, say experts. But there 
are fears that quelling the outbreak now will be 
more difficult than if it had been tackled earlier. 

The Ebola outbreak was first detected in 
March, but only the humanitarian group 
Médecins Sans Frontiéres (MSF, or Doctors 
Without Borders) has so far deployed substan- 
tial relief in the affected region, which centres 
on Liberia, Sierra Leone and Guinea (for more, 
see nature.com/ebola). MSF has repeatedly 
demanded that the international community 
launch a major effort to tackle the outbreak. 

The UN resolution calls on member states to 
respond urgently. As it was passed, UN secre- 
tary-general Ban Ki-moon launched the United 
Nations Mission for Ebola Emergency Response 
to deploy resources from UN agencies in coor- 
dination with efforts from member nations and 
donors. The mission’s remit includes “stopping 
the outbreak, treating the infected, ensuring 
essential services, preserving stability and pre- 
venting further outbreaks,” said Ban. 

The United States plans to establish mili- 
tary-run command and control headquarters 
in Liberia and build 17 treatment centres of 
100 beds each in the affected region. It hopes 
to recruit medical staff to run them, and train 
up to 500 health-care workers a week. 

The public-health community tends to 
view military involvement “with suspicion 
and mistrust’, says Adam Kamradt-Scott, a 
health-policy researcher at the University of 
Sydney in Australia. But he welcomes the US 
plan: “Military forces have unique skill sets that 
can assist civilian authorities.” If the operation 
does help to counter Ebola, he adds, it could 
set a new precedent for responding to disease 
outbreaks of international concern. 

Training forces in outbreak response will 
take time, warns Daniel Bausch, who stud- 
ies infectious diseases at Tulane University in 
New Orleans, Louisiana, and is caring for peo- 
ple with Ebola in Sierra Leone. He adds that 


the situation is too dire for the United States 
alone to solve, and that other nations must step 
in, something that the UN resolution should 
catalyse. It was passed by unanimous vote and 
had a high number of member-state sponsors. 
Jean-Clément Cabrol, director of operations 
for MSF in Geneva, Switzerland, says that both 
efforts are welcome, but that the situation has 
worsened in the past few weeks, so speed is 
key. Treatment centres in the three countries 
most affected are completely overwhelmed 
(see page 474). This is 


“Military fuelling the spread of the 
forces have disease, because centres 
unique skill are forced to send away 
sets that can infected people, who 
assist civilian — thencontaminate others. 
authorities.” Conventional public- 


health measures are no 
longer working, says Bausch. It has become 
next to impossible to detect and isolate those 
infected, and to monitor all the people they have 
been in contact with. He recommends focusing 
surveillance on areas next to affected zones to 
prevent further geographical spread of Ebola. 
He adds that international efforts will be effec- 
tive only if they include training for health-care 
staff, for example in biosafety. Money and sup- 
plies are welcome, but by themselves, they are 
not enough. m SEE EDITORIAL P.459 
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@ Population boom to last all century 
go.nature.com/xnsblj 

@ Male astronomers get bigger share of 
Hubble go.nature.com/v9e807 


IMAGE OF THE WEEK 


This map measures dust in the southern 
Galactic hemisphere. It suggests that 

§ a signal of 
gravitational 
waves from 

the Big Bang 
previously 
observed in the 
= black rectangle 
could have been spurious. 
go.nature.com/ifalmw 
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Jie ae (right), president of Shanghai Jiao Tong University awards an honorary doctorate to Peter Salovey, president of Yale University. 


Chinese university 
reform in three steps 


High-quality faculty, valued and rewarded, is the key to 
building a world-class research institution, says Jie Zhang. 


( ‘ater economic growth is slowing 
after 35 years of rapid expansion. 
Sustainable development depends 

on converting that growth into innovation. 

So the Chinese government has substan- 

tially increased its investment in universities 

and research institutes. In 2012, for exam- 
ple, it spent more than 1 trillion renminbi 

(US$161 billion) on research and develop- 

ment and more than 700 billion renminbi 

on higher education. 
As a result, research capacity and pro- 
ductivity have grown. Between 2005 and 

2012, the number of full-time-equivalent 


researchers in China increased by 38% (to 
314,000). Over the same period, the number 
of published research articles from Chinese 
higher-education institutions rose by 54% 
(to 1,117,742) and granted patents went up 
eightfold (to 66,755). 

Yet the quality of research, as indicated 
by citations, lags behind, and technology 
transfer is sluggish. Ossified practices in 
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evaluation and incentivization — such as 
rewarding publication quantity over qual- 
ity — are holding Chinese universities back. 

As president of a Chinese research uni- 
versity, I believe that building a high-quality 
faculty is the key to developing a world-class 
research university. In the past ten years, my 
institution, Shanghai Jiao Tong University 
(SJTU), has created a culture of innova- 
tion and boosted research capacity through 
reforms to career paths for existing and new 
faculty members. 

China’ leading universities are taking steps 
to enhance their research productivity. > 
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A robot developed by the School of Mechanical Engineering at Shanghai Jiao Tong University in China. 


> Setting up new institutes, hiring high- 
profile overseas scholars at high salaries and 
introducing tenure systems are proven short- 
cuts to excellence. But importing a handful of 
‘star’ professors is not sufficient to change uni- 
versity culture. Indeed, they can make local 
faculty members feel overlooked. Instead, 
fundamental reforms are needed. 

My experience at SJTU presents a good 
worked example. We have implemented a 
university-wide incentive system to motivate 
all faculty, staff and students. The goal is to 
develop a high-quality faculty comparable to 
that of the best Western universities by 2020. 

The changes began in 2007 with an 
analysis of future national and global chal- 
lenges, open discussions among faculty 
and staff, and a feasibility review. Priori- 
ties emerged, including engaging creative 
minds, incubating innovations and bridg- 
ing the sciences and humanities to best 
serve the nation and the world. To maintain 
faculty support, fundamental reforms were 
adopted in stages over several years. 


First: recruit and mentor junior staff. 
World-class scientists were hired to build 
research groups focused on cutting-edge 
science and engineering problems and to 
set a high bar for academic performance. 
Junior faculty members were recruited 
competitively in the international job mar- 
ket. Generous start-up funds supported 
the groups for the first few years, until they 
could attract funds from Chinese govern- 
ment agencies or industries, start to produce 
results and establish their reputations. 

A tenure system for new faculty members 
began in 2007. A six-year tenure track was 
set up for new junior faculty based on mid- 
term and final evaluation by an international 
committee. 


In 2008, we launched a mentoring system 
for junior faculty under 35 years old. And 
since then, 1,251 junior faculty members 
have won extra support for their research, 
housing and living costs from a 150-million- 
renminbi endowment. 


Second: three career tracks for faculty. 
Promotion and salary are assessed every three 
years against performance indicators devel- 
oped by each school or department. Three 
career tracks — teaching, research and tenure 
— were created in 2010 for existing faculty. 
The teaching track has no research expecta- 
tion; scientists with no teaching obligation 
must make up part of their income from com- 
petitive research grants. Tenure-track profes- 
sors teach and do research and are evaluated 
according to combined criteria. 

Each track has similar starting salary 
scales. Faculty members chose their own 
track, with the option to switch later if peer 
evaluations allow. All faculty may apply 
for transfer to the tenure-track system as 
standards rise; the best will be encouraged 
to do so. The average salary for faculty has 
increased by 60% in the past four years, and 
will rise further in the next four years. 

The university decentralized its governance 
to give schools and departments more auton- 
omy to recruit, develop and evaluate their 
staff. Budget reforms have given each school 
or department direct access to resources. 


Third: one merged tenure system. The infu- 
sion of fresh academic blood gave the univer- 
sity a split nature: international scholars on the 
Western tenure path; existing faculty follow- 
ing the three tracks. In 2013, six pilot schools 
or departments began to merge the two paths 
into a single tenure system similar to that of 
North American universities. Encouraging 
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results have already been seen in physics, 
mathematics, mechanical engineering, bio- 
medical engineering, law and management. 
On the basis of these experiences, the 
entire university will switch to the tenure 
system between 2015 and 2018. Although it 
is impossible to suddenly double or triple the 
salary scale for all 3,000 faculty members, 
mechanisms for paying competitive salaries 
within the tenure system are being built, 
such as setting up chairs and fellowships 
from a 500-million-renminbi endowment. 
Faculty members who do not qualify 
for the tenure track can choose to leave or 
remain in the contract system until it is 
phased out by 2018. It is important that fac- 
ulty who have served the university well but 
cannot meet the new standards be treated 
fairly and with respect and provided with a 
channel to continue to serve the institution. 


PROVEN RESULTS 

Today, SJTU is well on its way to being one 
of the leading research universities in the 
world. Since 2007, 450 world-class profes- 
sors and top-tier young faculty members of 
international standing have joined the uni- 
versity, and more than 250 existing faculty 
members have transferred to the tenure- 
track system. Eighty-five per cent of faculty 
hold PhDs (up from 50% in 2006). 

The university’s annual revenue has more 
than doubled since 2007, to more than 
7 billion renminbi; competitive research 
income has tripled to more than 2.5 billion 
renminbi. The number of disciplines offered 
at SJTU that are ranked in the global top 1% of 
Thomson Reuters’ Essential Science Indicator 
(ESD jumped from 5 in 2007 to 16 in 2014. 
Social sciences joined engineering, and natu- 
ral, life and medical sciences in the top 1%, 
making the university more comprehensive. 

In terms of number of papers published, 
the university has ranked second globally in 
engineering since 2007. ESI ranked natural 
sciences 40 in 2013, up from 57 in 2007. Life 
sciences and agriculture rose to 43 from 136, 
and medicine is now ranked 27, up from 166. 
Citations are up and more discoveries are 
being patented. 

The reforms at SJTU have promoted a 
shift in educational emphasis. We are mov- 
ing away from knowledge transfer to knowl- 
edge creation and from instruction-centred 
teaching to student-centred learning. Our 
philosophy has changed to nurturing stu- 
dents to be engaged, competent global 
citizens. A culture that values and rewards 
innovation has successfully taken root. = 


Jie Zhang is the president of Shanghai Jiao 
Tong University in Shanghai, China. He is 
an academician of the Chinese Academy 
of Sciences, and a foreign associate of 

US National Academy of Sciences. 

e-mail: jzhang1@sjtu.edu.cn 
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China opens translational 
medicine centre in Shanghai 


First of five linked institutes aims to capitalize on basic-research investments. 


BY DAVID CYRANOSKI 


ver the past decade, China has poured 
() money into research, especially in the 

biomedical sciences. But as the nation’s 
health-care costs have risen in the past few 
years, critics have argued that the investment 
has not paid off. A group of researchers and gov- 
ernment officials now hopes to improve those 
returns with the official opening this month of 
the National Centre for Translational Medicine 
in Shanghai. The 1-billion-renminbi (US$163- 
million) centre, slated to complete construction 
in 2017, is the first of five institutions meant to 
bridge the gap between basic research and clini- 
cal application by putting researchers, doctors 
and patients under one roof. 

China's biologists have made impressive 
progress in fields such as genome sequencing 
and protein-structure analysis, but have pro- 
duced little in terms of drugs or other medi- 
cal products. “Bloggers and others are always 
complaining that China is just burning money,’ 
says Xiao-Fan Wang, a cancer researcher at 
Duke University in Durham, North Carolina, 
and one of the 21 people on the centre’s inter- 
national consulting committee. Some even ask 
why China, which has made progress in indus- 
try largely by copying other countries, has not 
succeeded by following the same strategy in the 
biomedical sciences, says Wang. 

Wang says that doctors in China are over- 
worked, often racing through whole days of 
back-to-back patient visits and procedures. 
And yet, because most Chinese hospitals are 
part of universities, the doctors must publish 
to get promoted. In this competitive environ- 
ment, they often refuse to share data, but are 
rarely able to do thorough research themselves. 
“You can’t expect them to do eight hours of sur- 
gery and then jump into a lab coat,’ says Wang. 
The Shanghai centre will change that, he says, 
by giving clinicians time to focus on research 
questions. 

Saijuan Chen, the centre’s director and a 
geneticist at Shanghai Jiaotong University, says 
that the centre began interviews this month to 
recruit some 50 principal investigators and 
12 scientists to direct research in their disci- 
plines. She says that the institution's focus will 
be on developing treatments for heart disease, 
stroke, metabolic diseases and cancer. 

The centre’s international consulting 
committee will help to recruit top talent in these 


A major push is under way in China to develop new drugs and surgical procedures. 


fields. Tak Mak, a committee member and an 
immunologist at the University of Toronto in 
Canada, says that the strategy shows a commit- 
ment to hiring on the basis of expertise rather 
than connections. It is “an effort to break out of 
the old system — which is probably the system 
in more places than youd care to know’, he says. 


FINE BALANCE 

One challenge will be attracting Chinese 
biomedical researchers who are working 
abroad. Wang says that China has been able 
to lure basic-research scientists back to the 
country with competitive pay. But clinicians 
in China receive a fraction of the salaries they 
can get in the United States. 

Managing the centre may also be difficult, 
says committee member Sujuan Ba, chief oper- 
ating officer of the US National Foundation for 
Cancer Research in Bethesda, Maryland. The 
centre's governing council comprises members 
from ten government agencies and institutions. 
“That shows the wide scope of support from 
China, but it is going to be a huge challenge for 
the leadership team to balance and to manage 
each council member's requisites,” she says. “It 
is very important for the centre to steer away 
from bureaucratic red tape and stay focused on 
its mission and long-term vision of conducting 
high-impact translational research.” 

The 54,000-square-metre site will have 
300 beds for patients and study volunteers. 


It will also run a biobank that will collect 
hundreds of thousands of patient samples, 
and an ‘omics’ centre that will conduct high- 
throughput genome analyses and gather data 
on proteins and metabolic products. 

Haematologist Zhu Chen of Shanghai Jiao 
Tong University, who is China's former health 
minister and chairman of the centre's scien- 
tific advisory board, hopes that the centre will 
emulate the spirit of St Jude Children’s Research 
Hospital in Memphis, Tennessee, with its close 
relationship between clinicians and basic 
researchers. He also emphasizes the need to 
make therapeutic trials free: in China, partici- 
pants are often charged for treatment. Zhu and 
Saijuan Chen, who are married, led one of the 
few successful translational research projects in 
China — treatment ofa form of leukaemia using 
retinoic acid and arsenic trioxide. 

The Shanghai centre will eventually have 
four sister institutions: a geriatrics centre at 
the People’s Liberation Army general hospital 
in Beijing, a centre for rare and refractory dis- 
eases at the Peking Union Medical College in 
Beijing, a molecular-medicine research centre 
at the Fourth Military Medical University in 
Xian and a regenerative-medicine centre at the 
West China Hospital in Chengdu. 

“These centres are at the historic moment to 
make a huge impact for China’s drug develop- 
ment,’ says Ba. “We should be able to see the 
signs of success in five years, if not earlier: m 
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